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TUMOUR THERAPY 

The present invention relates to the therapy of tumours, particularly 
melanomas. 

5 

Biological therapy of cancer, based upon the adoptive transfer of modified 
immune cells, seeks to exploit in vivo specificity to deliver recombinant 
proteins directly to the tumour mass (Parmiani et al (1992) Trends Exp. 
Clin. Med. 2, 412-419; Rosenberg (1992)7. Clin. Oncol. 10, 180-100). 

10 However, this approach involves removal of cells from the patient 
followed by their in vitro manipulation and replacement in vivo. Proposed 
vaccination experiments using genetically modified tumour cells also 
require a similar period of passage in vitro during which time the 
neoplastic cells may significantly alter their immunological properties or 

15 growth characteristics (Rosenberg (1992) he. cit.; Roemer & Friedmann 
(1992) Eur. J. Biochem. 208, 211-225; Pardoll (1992) Curr. Opin. 
Immunol 4, 619-623); Fearon et al (1990) CeU 60, 397-403. 

There is experimental evidence that the expression of cytokines in tumour 
* 20 cells (following transfection with cytokine cDNA in vitro) leads to 
rejection of otherwise tumourigenic doses of tumour cells and, in some 
cases, can immunise animals against established diseases when the 
transfected cells are injected into the animal. Cytokines shown to have 
this effect include interleukin-2, interIeukin-4, interferon-7, tumour 
25 necrosis factor and interIeukin-7. This information is summarised in 
Pardoll (1992) Curr. Opinion Immunol 4, 619-623. 

CD28-positive T cell responses, and immune responses mediated by T 
cells, may be regulated by the B7 antigen as described in WO 92/00092. 
30 Also, tumour rejection after direct costimulation of CD8 + T cells by B7- 
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transfected melanoma cells is described in Townsend & Allison (1993) 
Science 259, 368-370. 

Malignant melanoma represents a cancer the growth and dissemination of 
5 which may be altered significantly by immunological manipulation. Many 
melanomas synthesise the pigment melanin, which is otherwise produced 
almost exclusively by melanocytes (Hearing & Tsukamoto (1991) FASEB 
J. 5, 2902-2909) and indeed several workers have proposed utilising the 
melanin synthetic pathway for chemotherapeutic intervention (Riley (1991) 
10 Eur. J. Cancer 27, 1 172-1 179; link & Carpenter (1992) Cancer Res. 52, 
4385-4390). 

The tyrosinase and TRP-1 genes both encode proteins which play key 
roles in the synthesis of the pigment melanin, a specific product of 
IS melanocytic cells. Our aim has been to utilise the 5' ends of the 
tyrosinase and tyrosinase-related protein" (TRP-1) genes to confer tissue 
specificity of expression on genes cloned downstream of these promoter 
elements for therapeutic purposes. 

20 A number of other groups already have shown that tissue specificity of 
expression resides within the 5' sequences of these genes (eg Bradl, M. 
et a! (1991) Proc. Natl. Acad. Sci. USA 88, 164-168; Jackson, LJ. et al 
(1991) Nucleic Acids Res. 19, 3799-3804). However we have confirmed 
and expanded these findings and used the promoters of these genes for 

25 therapeutic purposes. 

Prostate-specific antigen (PSA) is one of the major protein constituents of 
the human prostate secretion. It has become a useful marker for the 
detection and monitoring of prostate cancer. Other groups have 
30 characterised the gene encoding PSA and have identified the promoter 
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region which directs the prostate-specific expression of PSA (Lundwali 
(1989) Biochem. Biophys. Res. Comm. 161, 1151-1159; Riegman et al 
(1989) Biochem. Biophys. Res. Comm. 159, 95-102; Brawer (1991) Acta 
Oncol. 30, 161-168). 

5 

Carcinoembryonic antigen (CEA) is a widely used tumour marker, 
especially in the surveillance of colonic cancer patients. Although CEA 
is also present in some normal tissues, it is apparently expressed at higher 
levels in tumorous tissues than in corresponding normal tissues. The 

10 complete gene encoding CEA has been cloned and its promoter region 
analysed. A CEA gene promoter construct, containing approximately 400 
nucleotides upstream from the translational start, showed nine times higher 
activity in the adenocarcinoma cell line SW303, compared with the HeLa 
cell line. .This indicates that exacting sequences which convey cell type 

15 specific expression are contained within this region (Schrewe et al (1990) 
Mol. CeU. Biol. 10, 2738-2748). 

The c-er&B-2 gene and promoter have been characterised previously and 
the gene product has been shown to be over-expressed in tumour cell lines 
20 (Kraus et al (1987) EMBO J. 6, 605-610). 

The mucin gene, MUC1, contains 5' flanking sequences which are able 
to direct expression selectively in breast and pancreatic cell lines, but not 
in non-epithelial cell lines as taught in WO 91/09867. 

25 

Summary of the Invention 

One aspect of the invention provides a DNA construct comprising (i) 
means for expression of a coding sequence in a tumour cell and (ii) a said 
30 coding sequence encoding a cytokine. 
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Expression of the cytokine in the tumour cells is believed to stimulate 
attack by T cells, especially LAK cells. Such T cells will then destroy not 
only the primary tumour but also any secondary (metastatic) growths. 

5 The tumour may be a melanoma, or a tumour of the breast, colon, brain, 
pancreas, bladder, skin, prostate, stomach, oesophagus or liver, for 
example. Preferably, it is a melanoma. 

Advantageously, the said means for expression provides for specific 
10 expression selectively in tumour cells. Otherwise, the T cells-may attack 
normal cells and/or the germ line may be altered. 

By "specific expression selectively in tumour cells" we mean that the 
expression is usefully higher (for example 2X, 5X, 10X or at least 20X 

IS higher) in tumour cells compared to the expression in non-tumour cells. 
It will be appreciated by those skilled in the art that tumour selective 
expression may be derived from tissue-specific expression where the 
tumour rapidly grows from a specific tissue type. Alternatively, highly 
specific delivery of a non-specific expression construct may be adequate. 

20 Known means such as targeted liposomes (carrying anti-tumour-marker 
antibodies) and viruses, including retroviruses, may be employed. 



The constructs of the invention may be introduced into the tumour cells 
by any convenient method, for example methods involving retroviruses, 

25 so that the construct is inserted into the genome of the tumour cell. For 
example, in Kuriyama et al (1991) Cell Struc. and Func. 16, 503-510 
purified retroviruses are administered. Retroviruses provide a potential 
means of selectively infecting cancer cells because they can only integrate 
into the genome of dividing cells; most normal cells surrounding cancers 

30 are in a quiescent, non-receptive stage of cell growth. Retroviral DNA 



constructs which contain a promoter segment and a cytokine coding 
sequence may be made using methods well known in the art To produce 
active retrovirus from such a construct it is usual to use an ecotropic psi2 
packaging cell line grown in Dulbecco's modified Eagle's medium 
(DMEM) containing 10% foetal calf serum (FCS). Transfection of the 
cell line is conveniently by calcium phosphate co-precipitation, and stable 
transformants are selected by addition of G418 to a final concentration of 
1 mg/ml (assuming the retroviral construct contains a necP gene)* 
Independent colonies are isolated and expanded and the culture supernatant 
removed, filtered through a 0.45 /*m pore-size filter and stored at r70°. 
For the introduction of the retrovirus into the tumour cells, it is convenient 
to inject directly retroviral supernatant to which 10 /xg/ml Polybrene has 
been added. For tumours exceeding 10 mm in diameter it is appropriate 
to inject between 0. 1 ml and 1 ml of retroviral supernatant; preferably 0.5 
ml. Alternatively, as described in Culver et al (1992) Science 256, 1550- 
1552, cells which produce retroviruses are injected into the tumour. The 
retrovirus-producing cells so introduced are engineered to actively produce 
retroviral vector particles so that continuous productions of the vector 
occurred within the tumour mass in situ. Thus, proliferating tumour cells 
can be successfully transduced in vivo if mixed with retroviral vector- 
producing cells. Other methods involve simple delivery of the construct 
into the cell for expression therein either for a limited time or, following 
integration into the genome, for a longer time. An example of the latter 
approach includes (preferably tumour-cell-targeted) liposomes (Nassander 
et al (1992) Cancer Res. 52, 646-653). 

Immunoliposomes (antibody-directed liposomes) are especially useful in 
targeting to cancer cell types which over-express a cell surface protein for 
which antibodies are available. In relation to the present invention, 
antibodies directed towards tumour cell antigens such as CEA and PSA 
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are preferred. For the preparation of immuno-liposomes MPB-PE (N-[4- 
(p-maleimidophenyl)butyryl]-phosphatidylethanolamine) is synthesised 
according to the method of Martin & Papahadjopoulos (1982) 7. Biol 
Chertu 257, 286-288. MPB-PE is incorporated into the liposomal bilayers 

5 to allow a covalent coupling of the antibody, or fragment thereof, to the 
liposomal surface. The liposome is conveniently loaded with the DNA 
construct of the invention for delivery to the target cells, for example, by 
forming the said liposomes in a solution of the DNA construct, followed 
by sequential extrusion through polycarbonate membrane filters with 0.6 

10 fim and 0.2 ftm pore size under nitrogen pressures up to 0.8 MPa. After 
extrusion, entrapped DNA construct is separated from free DNA construct 
by ultracentrifugation at 80 000 x g for 45 min. Freshly prepared MPB- 
PE-liposomes in deoxygenated buffer are mixed with freshly prepared 
antibody (or fragment thereof) and the coupling reactions are carried out 

IS in a nitrogen atmosphere at 4°C under constant end over end rotation 
overnight. The immunoliposomes are separated from unconjugated 
antibodies by ultracentrifugation at 80 000 x g for 45 min. 
Immunoliposomes may be injected intraperitoneally or directly into the 
tumour. 

20 

It will be appreciated that monoclonal antibodies or other molecules that 
bind to tumour cell surface antigens are useful in targeting the DNA 
construct of the invention. 

25 Monoclonal antibodies which will bind to many of these antigens are 
already known but in any case, with today's techniques in relation to 
monoclonal antibody technology, antibodies can be prepared to most 
antigens. The antigen-binding portion may be a part of an antibody (for 
example a Fab fragment) or a synthetic antibody fragment (for example 

30 a single chain Fv fragment [ScFv]). Suitable monoclonal antibodies to 
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selected antigens may be prepared by known techniques, for example 
those disclosed in "Monoclonal Antibodies: A manual of techniques", H 
Zola (CRC Press, 1988) and in "Monoclonal Hybridoma Antibodies: 
Techniques and Applications*, J G R Hurrell (CRC Press, 1982). 

5 

Chimaeric antibodies are discussed by Neuberger et al (1988, 8th 
International Biotechnology Symposium Part 2, 792-799). 

Suitably prepared non-human antibodies can be "humanized" in known 
10 ways, for example by inserting the CDR regions of mouse antibodies into 
the framework of human antibodies. Such "humanized" antibodies, or 
fragments thereof, are preferred as they may give rise to a lower anti- 
antibody reaction than rodent antibodies. 

15 The variable heavy (V H ) and variable light (VY) domains of the antibody 
are involved in antigen recognition, a" fact first recognised by early 
protease digestion experiments. Further confirmation was found by 
"humanisation" of rodent antibodies. Variable domains of rodent origin 
may be fused to constant domains of human origin such that the resultant 

20 antibody retains the antigenic specificity of the rodent parented antibody 
(Morrison et al (1984) Proc. Natl. Acad. Sci. USA 81, 6851-6855). 

That antigenic specificity is conferred by variable domains and is 
independent of the constant domains is known from experiments involving 

25 the bacterial expression of antibody fragments, all containing one or more 
variable domains. These molecules include Fab-like molecules (Better et 
al (1988) Science 240, 1041); Fv molecules (Skerra et al (1988) Science 
240, 1038); single-chain Fv (ScFv) molecules where the V H and V L 
partner domains are linked via a flexible oligopeptide (Bird et al (1988) 

30 Science 242, 423; Huston et al (1988) Proc. Natl Acad. Sci. USA 85, 
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5879) and single domain antibodies (dAbs) comprising isolated V domains 
(Ward et al (1989) Nature 341, 544). A general review of the techniques 
involved in the synthesis of antibody fragments which retain their specific 
binding sites is to be found in Winter & MUstein (1991) Nature 349, 293- 
5 299. 

By a ScFv molecules" we mean molecules wherein the V H and V L partner 
domains are linked via a flexible oligopeptide. 

10 The advantages of using antibody fragments, rather than whole antibodies, 
are several-fold. The smaller size of the fragments may lead to improved 
pharmacological properties, such as better penetration of solid tissue. 
Effector functions of whole antibodies, such as complement binding, are 
removed. Fab, Fv, ScFv and dAb antibody fragments can all be 

15 expressed in and secreted from E. coli, thus allowing the facile production 
of large amounts of the said fragments. 

Whole antibodies, and F(ab*) 2 fragments are "bivalent". By "bivalent" 
we mean that the said antibodies and F(ab*) 2 fragments have two antigen 
20 combining sites. In contrast, Fab, Fv, ScFv and dAb fragments are 
monovalent, having only one antigen combining sites. 

Other molecules immunologically reactive with the target cell surface 
molecule are also useful in this aspect of the invention and include, for 
25 example minimal recognition units (MRU) and complementarity 
determining regions. 

Other methods of delivery include adenoviruses carrying external DNA via 
an antibody-polylysine bridge (see Curiel Prog. Med. Virol. 40, 1-18) and 
30 transferrin-polycation conjugates as carriers (Wagner et al (1990) Proc. 
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Natl Acad. Set USA 87, 3410-3414). In the first of these methods a 
polycation-antibody complex is formed with the DNA construct of the 
invention, wherein the antibody is specific for either wild-type adenovirus 
or a variant adenovirus in which a new epitope has been introduced which 
5 binds the antibody. The polycation moiety binds the DNA via electrostatic 
interactions with the phosphate backbone. The adenovirus, because it 
contains unaltered fibre and pentos proteins, is internalized into the cell 
and carries into the cell with it the DNA construct of the invention. It is 
preferred if the polycation is polylysine. 

10 

In the second of these methods, a high-efficiency nucleic acid delivery 
system that uses receptor-mediated endocytosis to carry DNA 
macromolecules into cells is employed. This is accomplished by 
conjugating the iron-transport protein transferrin to polycations that bind 

15 nucleic acids. Human transferrin, or the chicken homologue conalbumin, 
or combinations thereof is covalently linked to the small DNA-binding 
protein protamine or to polylysines of various sizes through a disulfide 
linkage. These modified transferrin molecules maintain their ability to 
bind their cognate receptor and to mediate efficient iron transport into the 

20 cell. The transferrin-polycation molecules form electrophoretically stable 
complexes with DNA constructs of the invention independent of nucleic 
acid size (from short oligonucleotides to DNA of 21 kilobase pairs). 
When complexes of transferrin-polycation and the DNA constructs of the 
invention are supplied to the tumour cells, a high level of expression from 

25 the construct in the cells is expected. 

High-efficiency receptor-mediated delivery of the DNA constructs of the 
invention using the endosome-disruption activity of defective or chemically 
inactivated adenovirus particles produced by the methods of Cotten et al 
30 (1992) Proc. Natl Acad. Scl USA 89, 6094-6098 may also be used. This 
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approach appears to rely on the fact that adenoviruses are adapted to allow 
release of their DNA from ah endosome without passage through the 
lysosome, and in the presence of, for example transferrin linked to the 
DNA construct of the invention, the DNA construct is taken up by the cell 
5 by the same route as the adenovirus particle. 

It may be desirable to locally perfuse a tumour with the delivery vehicle 
(for example the retrovirus) for a period of time. 

10 In one embodiment of the invention the said means for expression 
provides for specific expression selectively in melanoma cells or in 
melanoma cells and melanocytes. In this embodiment the said means for 
expression is a promoter or an analogue or part thereof forming part of a 
gene expressed substantially exclusively in the melanin synthesis pathway. 

15 Examples of such promoters include the tyrosinase gene promoter and the 
tyrosinase-related protein (TRP-1) gene promoter. 

By "promoter" we mean that region of DNA which controls, at least to 
a substantial extent, the transcription of the coding region associated with 
20 that region of DNA. 

In a further embodiment of the invention the said means for expression 
provides for specific expression selectively in prostate cancer cells or 
prostate cancer cells and prostate cells. In this embodiment the said 
25 means for expression is a promoter or an analogue or part thereof forming 
part of a gene expressed substantially exclusively in prostate cancer or 
prostate cells. An example of such a promoter is the prostate-specific 
antigen (PSA) gene promoter. 

30 In a still further embodiment of the invention the said means for 
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expression provides for specific expression selectively in colonic cancer 
cells, or colonic cancer cells and colon cells. In this embodiment the said 
means for expression is a promoter or an analogue or part thereof forming 
part of a gene expressed substantially exclusively in colon cancer or colon 
5 cells. An example of such a promoter is the carcinoembryonic antigen 
(CEA) gene promoter. 

In another embodiment of the invention the said means for expression is 
provided by the promoter region of the c-e/£B2-gene. 

10 

In this embodiment the constructs comprising the c-erbB2 gene promoter 
fused to the cytokine coding sequence may be usefully delivered to breast 
tumours. The c-erbB3 gene promoter may also be used. 

IS In yet another embodiment the said means for expression is provided by 
the promoter region of the MUC1 gene. 

In this embodiment pancreatic or breast tumours may usefully receive the 
constructs comprising MUC1 gene promoter fused to the cytokine coding 
20 sequence. 

DNA sequences encompassing the promoter sequences useful in the 
invention are given in the sequence listing. 

25 The cytokine is preferably interIeukin-2 or interleukin-4 or macrophage 
colony stimulating factor. Other cytokines may, however, be used, for 
example interferon-?, tumour necrosis factor, and interleukin-7. 
Nucleotide coding sequences for these are known and are given in the 
sequence listing. 

30 
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The promoter is joined to the cytokine coding region and placed in a 
suitable vector system for propagation. The skilled person can use the 
information given below containing the promoter DNA sequences and 
coding sequences of some of the cytokines useful in the invention to make 
5 suitable constructs. For example, a knowledge of the DNA sequences 
provides information on where restriction enzyme will cleave the said 
DNA molecules and allows oligonucleotide primers to be designed for 
PCR amplification and site-directed mutagenesis. 

10 The vector is then introduced into the host through standard techniques. 
Generally, not all of the hosts will be transformed by the vector. 
Therefore, it will be necessary to select for transformed host cells. One 
selection technique involves incorporating into the vector a DNA 
sequence, with any necessary control elements, that codes for a selectable 

15 trait in the transformed cell, such as antibiotic resistance. Alternatively, 
the gene for such selectable trait can be on another vector, which is used 
to co-transform the desired host cell. 

Host cells that have been transformed by the recombinant DNA construct 
20 of the invention are then cultured for a sufficient time and under 
appropriate conditions known to those skilled in the art in view of the 
teachings disclosed herein to permit the propagation of the DNA construct, 
which can then be recovered. 

25 The vectors usually include a procaryotic replicon, such as the ColEl ori 9 
for propagation in a procaryote, even if the vector is to be used for 
expression in other, non-procaryotic, cell types. 

It is preferred if the host cell is E. coli. 

30 
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A variety of methods have been developed to operatively link DNA to 
vectors via complementary cohesive termini. For instance, 
complementary homopolymer tracts can be added to the DNA segment to 
be inserted to the vector DNA. The vector and DNA segment are then 
5 joined by hydrogen bonding between the complementary homopolymeric 
tails to form recombinant DNA molecules* 

Synthetic linkers containing one or more restriction sites provide an 
alternative method of joining the DNA segment to vectors. Hie DNA 
10 segment, generated by endonudease restriction digestion as described 
earlier, is treated with bacteriophage T4 DNA polymerase or E. coli DNA 
polymerase I, enzymes that remove protruding, 3 '-single-stranded termini 
with their 3'-5'-exonucleoIytic activities, and fill in recessed 3 '-ends with 
their polymerizing activities. 

15 

The combination of these activities therefore generates blunt-ended DNA 
segments. The blunt-ended segments are then incubated with -a large 
molar excess of linker molecules in the presence of an enzyme that is able 
to catalyze the ligation of blunt-ended DNA molecules, such as 
20 bacteriophage T4 DNA ligase. Thus, the products of the reaction are 
DNA segments carrying polymeric linker sequences at their ends. These 
DNA segments are then cleaved with the appropriate restriction enzyme 
and ligated to an expression vector that has been cleaved with an enzyme 
that produces termini compatible with those of the DNA segment. 

25 

Synthetic linkers containing a variety of restriction endonudease sites are 
commercially available from a number of sources including International 
Biotechnologies Inc, New Haven, CN, USA. 

30 DNA fragments with complementary cohesive termini are readily joined 
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Sambrook et al (1989) Molecular Cloning, A laboratory manual, 2nd 
Edition, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New 
York. 

5 

A desirable way to modify the promoter fragment, vector or coding region 
to be fused in the DNA construct is to use the polymerase chain reaction 
as disclosed by Saiki et al (1988) Science 239, 487-491. 

10 In this method the DNA to be enzymatically amplified is flanked by two 
specific oligonucleotide primers which themselves become incorporated 
into the amplified DNA. The said specific primers may contain restriction 
endonuclease recognition sites which can be used for cloning into 
expression vectors using methods known in the art. 

15 

The present invention also relates to a host cell transformed with a 
polynucleotide vector construct of the present invention. The host cell for 
propagating the DNA construct can be either procaryotic or eucaryotic. 
Bacterial cells are preferred host cells and typically are a strain of E. coli 
20 such as, for example, the E. colt strains DH5 available from Bethesda 
Research Laboratories Inc., Bethesda, MD, USA, and RR1 available from 
the American Type Culture Collection (ATCQ of Rockville, MD, USA 
(No ATCC 31343). 

25 Transformation of appropriate cell hosts with a DNA construct of the 
present invention is accomplished by well known methods that typically 
depend on the type of vector used. With regard to transformation of 
bacterial, especially E. coli host cells, see, for example, Cohen et al 
(1972) Proc. Natl. Acad. Sci. USA 69, 2110 and Sambrook et al (1989) 

30 Molecular Cloning, A Laboratory Manual, Cold Spring Harbor 
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Laboratory, Cold Spring Harbor, NY. 

Successfully transformed cells, ie cells that contain a DNA construct of 
the present invention, can be identified by well known techniques. For 
5 example, cells can be harvested and lysed and their DNA content 
examined for the presence of the DNA using a method such as that 
described by Southern (1975) /. Mol. Biol 98, 503 or Berent et al (1985) 
Biotech. 3, 208 or by isolating the plasmid vector DNA and then digesting 
the said plasmid appropriate restriction enzymes that give diagnostic DNA 
10 fragments that can be separated and sized by gel electrophoresis. 

The DNA construct of the invention is purified from the host cell using 
well known methods. 

15 For example, plasmid vector DNA can be prepared on a large scale from 
cleaved lysates by banding in a CsCl gradient according to the methods of 
Clewell & Helinski (1970) Biochemistry 9, 4428-4440 and CleweH (1972) 
/. Bacterid 110, 667-676. Plasmid DNA extracted in this way can be 
freed from CsCl by dialysis against sterile, pyrogen-free buffer through 

20 Visking tubing or by size-exclusion chromatography. 

Alternatively, plasmid DNA may be purified from cleared lysates using 
ion-exchange chromatography, for example those supplied by Qiagen. 
Hydroxyapatite column chromatography may also be used. 

25 

Preferably, naked DNA is injected in the tumour, for example at a dose 
of 0. 1 ng to 1 .0 mg vector DNA cm" 3 of tumour, preferably about 0. 1 -10 
fig cm" 3 vector DNA. The DNA may be circular or linear. linear DNA 
may be obtained from circular DNA by cleavage with an appropriate 
30 restriction enzyme. 
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By "appropriate restriction enzyme* we mean one that does not cleave the 
DNA within the promoter region or cytokine coding region. 

At present, it is most preferable to use 1 -0 ng of DNA per cm 3 of tumour 
5 in a volume of 100 pi. The DNA may be dissolved in phosphate-buffered 
saline (PBS), or it may be used as a precipitate with calcium phosphate. 
Of course, other suitable buffers or carriers may usefully be employed. 
The expression of the said DNA in the tumour may be analysed by reverse 
transcriptase-PCR (that is, the messenger RNA expressed from the DNA 
10 in the tumour is isolated, converted into complementary DN A (cDN A) 
using the enzyme reverse transcriptase, and the resultant cDNA is 
amplified using the polymerase chain reaction and may be detected 
radiolabelling or staining), or by northern blot analysis or by KNase 
protection assays. 

15 

Such injection may be repeated at hourly, daily or weekly intervals. 

Uptake of naked DNA may depend on the three-dimensional growing mass 
of tumour so, although it is preferred that the tumour to be treated is 
- 20 melanoma, a prostate tumour, or a colon tumour or a pancreatic tumour, 
or a breast tumour, it may be any solid tumour. 

It is most preferred if substantially all cells in the tumour take up DNA 
and express the cytokine, but it is not essential for a useful clinical effect, 
25 as the antitumour effect of the cytokine is not limited to the tumour cell 
expressing the cytokine but will occur in non-transfected cells within the 
tumour and at secondary (metastatic) sites. Thus, if 5 % , preferably 25 % , 
more preferably 50% and most preferably substantially 100% of the 
tumour cells express the cytokine a clinically useful effect may be seen. 

30 



17 

It is desirable to express a plurality of cytokine coding sequences in a 
tumour cell, or to express a plurality of cytokine coding sequences in a 
tumour wherein each cytokine coding sequence is present in a separate 
DNA construct. It is preferable if the different cytokines, expressed by 
the plurality of coding sequences, stimulate different effector cells of the 
immune system. 

In one embodiment, each of the coding sequences of the plurality are 
directly joined to a means for expression in a tumour cell but are 
contained within the same DNA construct Thus, once the DNA is 
introduced into the tumour, every cell that takes up the DNA may express 
all of the cytokine coding sequences in the plurality. 

In a further embodiment, a plurality of DNA constructs is introduced into 
the tumour, each construct of the plurality comprises a means for 
expression of a coding sequence in a tumour cell and a coding sequence 
encoding a different cytokine. In this embodiment it is possible -to vary 
the proportion of cytokine coding sequences in the plurality. 

The components of the plurality comprise two or more of coding 
sequences encoding interleukin-2, interleukin-4, macrophage colony 
stimulating factor, interferon-?, tumour necrosis factor and interIeukin-7. 
The ratio of any two of the said coding sequences in the plurality may be, 
one to another, 100:1, 10:1 or 1:1. 

Thus, a particular plurality of coding sequences useful in the invention is 
interleukin-2:interIeukin-4:macrophage colony stimulating factor in a 
molar ratio of 1 : 1 : 1 . This particular combination of coding sequences will 
express a plurality of cytokines useful in attracting cytotoxic T cells, 
eosinophils and macrophages to the tumour, and to secondary (metastatic) 
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sites. All of these cell types have been shown to have anti-tumour 
activity. 

It is preferred that the means of expressing each coding sequence in the 
5 plurality is a tumour specific promoter. 

It is preferred that the plurality of DNA constructs is injected directly into 
the tumour. 

10 It is further preferred that the tumour into which the DNA construct is 
injected directly is a melanoma, breast cancer, prostate cancer or colon 
cancer. 

It is desirable to treat melanoma with a DNA construct wherein the means 
15 of expressing is the tyrosinase promoter. 

In a further embodiment it is preferred if the B-cell accessory molecule B7 
antigen is co-expressed with the cytokine in the tumour or tumour cell. 
B7 binds CD28 on T-cells and stimulates the activity of T-cells against 
20 tumours as is described in WO 92/00092. 

The cDNA encoding the B7 antigen molecule can be obtained using the 
method described by Freeman et al (1989) J. Immunol 143, 2714-2722 
incorporated herein by reference and the nucleotide and predicted amino 
25 acid sequence can be obtained therefrom. The nucleotide sequence of B7 
cDNA is given as SEQ ID No 23. 

The term "fragment** as used herein means a portion of the amino acid 
sequence corresponding to the B7 antigen. For example, a fragment of 
30 the B7 antigen useful in the method of the present invention is a 
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polypeptide containing a portion of the amino acid sequence corresponding 
to the extracellular portion of the B7 antigen, ie the DNA encloding amino 
acid residues from position 1 to 215 of the sequence corresponding to the 
B7 antigen described by Freeman et al, supra. 

5 

Complementary cDNA sequences encoding the amino add sequence 
corresponding to the B7 antigen or fragments or derivatives thereof can 
be syrithesised by the polymerase chain reaction (see US Patent No 
4,683,202) using primers derived from the published sequence of the 
10 antigen (Freeman et al, supra). These cDNA sequences can then be 
assembled into a vector so that the expression of the B7 antigen is driven 
by a means for expression in the tumour cell. 

It is preferred if the means for expression is a tumour-specific promoter. 

15 

It is further preferred if the promoter is the tyrosinase or TRP-1 promoter. 
It is preferred if the tumour is melanoma. 

20 The techniques for assembling and expressing DNA encoding the amino 
acid sequences corresponding to B7 antigen and the cytokines useful in the 
invention, eg synthesis of oligonucleotides, PCR, transforming cells, 
constructing vectors and the like are well-established in the art, and most 
practitioners are familiar with the standard resource materials for specific 

25 conditions and procedures. However, the following paragraphs are 
provided for convenience and notation of modifications where necessary, 
and may serve as a guideline. 

Complementary cDNA clones containing DNA encoding B7 proteins are 
30 obtained to provide DNA for assembling into the DNA constructs for use 
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in the methods of the invention. Alternatively, cDNA clones may be 
prepared from RNA obtained from cells expressing B7 antigen or the 
cytokines based on knowledge of the published sequences for these 
proteins using standard procedures. Published sequences for the cDNAs 
are given as SEQ ID Nos. 

The cDNA is amplified using the polymerase chain reaction ( U PCR W ) 
technique (see US Patent Nos. 4,683,195 and 4,683,202 to Mullis et al 
and Mullis & Faloona (1987) Methods EnzymoL 154, 335-350) using 
synthetic oligonucleotides encoding the sequences of the proteins as 
primers. PCR is then used to adapt the fragments for ligation to the DNA 
encoding the promoter fragments and to expression plasmid DNA to form 
cloning and expression plasmids. 

It is desirable to express a single cytokine coding sequence or a plurality 
of cytokine coding sequences in a tumour cell, in combination with the B7 
coding sequence, or to express a cytokine coding sequence in a tumour in 
combination with a B7 coding sequence wherein the cytokine coding 
sequence and the B7 coding sequence are present in a separate DNA 
construct. It is preferable if the different cytokines, expressed by the 
plurality of coding sequences, stimulate different effector cells of the 
immune system. 

In one embodiment, each of the coding sequences of the plurality of 
cytokines or B7 coding sequence are direcdy joined to a means for 
expression in a tumour cell but are contained within the same DNA 
construct. Thus, once the DNA is introduced into the tumour, every cell 
that takes up the DNA may express all of the cytokine coding sequences 
in the plurality and the B7 coding sequence. 
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In a further embodiment, a plurality of DNA constructs is introduced into 
the tumour, each construct of the plurality comprises a means for 
expression of a coding sequence in a tumour cell or a coding sequence 
encoding a different cytokine or B7 molecule. In this embodiment it is 
5 possible to vary the proportion of cytokine coding sequences and B7 
molecules introduced into the tumour. 

It will be appreciated by one skilled in the art that the same or different 
cytokine or B7 coding sequence may be expressed in the tumour cell from 

10 separate DNA constructs or that the said coding sequences* may be 
expressed in the tumour cell from the same DNA construct wherein each 
coding sequence has an independent means for expression or that the said 
coding sequences may be expressed in the tumour cell from the same 
DNA construct wherein each coding sequence has the same means for 

15 expression. In the latter case the coding sequences for a cytokine or a B7 
may be fused such that a fusion polypeptide is made; it is preferred if a 
linker joins the polypeptides in the fusion that is cleaved* in the 
environment of the tumour cell to release the active cytokine or B7. 

20 When melanoma is to be treated by the DNA constructs comprising a gene 
promoter from a melanin synthesis pathway gene such as tyrosinase, it is 
desirable if the patient to be treated is not black. 

It is further preferred if the patient to be so treated is fair-skinned. 

25 

In a further aspect of the invention the DNA constructs are used in 
conjunction with chemotherapy. Thus, the DNA construct, or a plurality 
of such constructs, may be administered at the same time as, preceding or 
after treatment with chemotherapeutic agents. 



30 
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Chemotherapeutic agents useful in this aspect of the invention include 
cisplatin, dacarbazine, tamoxifen, nitrosoureas including carnwsine 
(BCNU), vinca alkaloids, melphalan, doxorubicin, adriamycin, etoposide, 
5-fIuorouracil and other generally used agents. 

5 

These are listed in the table: 
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TABLE; CHKMOTHERAPEIITIC AGENTS 


Class 


Type of Agent 

* 


Nonproprietary 
Names (Other 
Names) 






Mechlorethaminc 
(HN2) 






Cyclophosphamide 
Ifosfamidc 


1 


XNitrogen rviustarus 


Melphalan 
(L-sarcolysin) 






Chlorambucil 


Alleviating A (rente 


Ethylenimines and 
Methylmelamines 


Hexamethylmela-mine 




Thiotepa 




Alkyl Sulfonates 


Busulfan 1 






Carmustine | 
fliCNTO U 






Lomustine (CCNU) | 




Nitrosoureas 


Semustine . II 
(methyl-CCNU) | 






Streptozocin 
(streptozotocin) 




Triazenes 


Decarbazine 
(DT1C; 
dimethyltria- 
zenoimi- 

dazolecarbox-amide) 
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Class 


Type of Agent 


Nonproprietary 1 
Names (Other | 
Names) | 






Folic Acid 
Analogs 


Methotrexate | 
(amethopterin) | 


5 


Antimetabolites 


Pyrimidine Analogs 


Fluorouracil 

(5-fluorouracil; 

5-FU) 

rioxunoine 

(fluorode- 

oxyuridine; FUdR) 








^yuiiaDinc. 

(cytosine 

arabinoside) 


10 






M ercaptopurine 
(6-mercaptopurine; 6- 
MP) 




Antimetabolites 
continued 


Purine Analogs 

and Related Inhibitors 


Thioguanine 
(6-thioguanine; TG) 






Pentostatin 
^'-deoxycoformycin) 
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Class 



10 



15 



Natural Products 



Type of Agent 



Vinca Alkaloids 



Epipodophyl- 
lotoxins 



Antibiotics 



Enzymes 



Biological 

Response 

Modifiers 



Nonproprietary 
Names (Other 
Names) 



Vinblastine 
(VLB) 



Vincristine 



Etoposide 
Teniposide 



Dactinomycin 
(actinomydn D) 



Daunorubicin 
(daunomycin; 
rubidomycin) 



Doxorubicin 



Bleomycin 



Plicamycin 
(mithramycin) 



Mitomycin # 
(mitomycin Q 



L- Asparaginase 



Interferon alfa 



20 



Miscellaneous 
Agents 



Platinum 
Coordination 
Complexes 



Methyl Hydrazine 
Derivative 



Cisplatin (cw-DDP) 
Caiboplatin 



1 Anthracenedione 


1 Mitoxantrone 


| Substituted Urea 


Hydroxyurea 



Procarbazine 

(N-methylhydrazine, 

MIH) 
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It is preferred if the DNA construct or the plurality of constructs expresses 
interleukin-2 which will facilitate the substantial destruction of the 
vasculature and promote the action of the chemotherapeutic agent 

5 Further aspects of the invention provide a composition comprising a 
construct of the invention and means for selectively delivering it to a 
tumour and a method of treating a tumour and/or ameliorating metastasis 
therefrom comprising delivering into cells of the tumour a construct of the 
invention. 

10 

The invention will now be described with reference to the following 
Examples and Figures wherein: 

Figure 1 shows the tissue specific expression cassettes using the tyrosinase 
15 and the TRP-1 gene promoters; 

Figure 2 shows the relative activity of tyrosinase and TRP-1 promoters in 
murine B16.F1 melanoma and NIH 3T3 cells; 

20 Figure 3 shows the retroviral vector pBabe Puro (Tyr-jS-Gal). 

Figure 4 shows the c-erbB-2/CAT construct of Example 5. 

Figure 5 shows the result of a comparison of activity of the construct of 
25 Example 5 in two cell lines: T47D, which is a breast carcinoma cell line 
with base line c-erbB-2 expression, and ZR75-1, which is a breast 
carcinoma cell line with elevated c-erbB-2 expression. 



30 



SEQ ID No 1 shows the nucleotide sequence of the CEA gene including 
the promoter region. 
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SEQ ID No 2 shows the sequence of the PSA gene including the promoter 
region. 

Figure 6 shows the 5' flanking sequence with 71 bp of transcribed 
5 sequence of the human MUC1 gene (SEQ ID No 3). The TATA box 
(boxed) and transcriptional start site (+1) are indicated. The sequence (- 
787 to +71) covers the region required for maximum transcription of the 
reporter gene (-743 to +33). 

10 Figure 7 shows the DNA sequence of the human c-eriB-2 5' region as 
determined by Hudson et al (1990) J. Biol Chenu 265, 4389-4393 (SEQ 
ID No 4). 

Figure 8 shows the DNA sequence of the human c-erbB-3 5' region (SEQ 
15 ID No 5) and the predicted amino acid sequence of the first exon (SEQ ID 
No 6). 

SEQ ID No 7 shows the DNA sequence of the tyrosinase promoter. 
20 SEQ ID No 8 shows the DNA sequence of the TRP-1 promoter. 

SEQ ID No 9 shows the DNA gene sequence encoding interleukin-2 QL, 
2); the cDNA sequence is readily derived from the positions of the exons. 

25 SEQ ID No 10 shows the cDNA sequence encoding interleukin-4 (IL-4). 

SEQ ID No 11 shows the cDNA sequence encoding interleukiti-7 (IL-7). 

SEQ ID No 12 shows the cDNA sequence encoding tumour necrosis 
30 factor (TNF). 
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SEQ ID No 21 shows the cDNA sequence encoding interferon-gamma 
(IFN-7). 

SEQ ID No 22 shows the cDNA sequence encoding human granulocyte 
5 macrophage colony stimulating factor GM-CSF. 

SEQ ID No 23 shows the B7 cDNA sequence. 

The following information is useful to the person skilled in the art to 
10 identify coding regions and promoter sequences for use in the invention. 
Journal references and EMBL database accession numbers are given. 

SEQ ID No 1 

15 ID HSCEA01 standard; DNA; PRI; 3281 BP; AC M59255; M31966; DE 
Human carcinoembryonic antigen (CEA) gene, complete cds; KW 
carcinoembryonic antigen; OS Homo sapiens (human); OC Eukaryota; 
Animalia; Metazoa; Chbrdata; Vertebrata; Mammalia; OC Theria; 
Eutheria; Primates; Haplorhini; Catarrhini; Hominidae; RN [1]; RP 

20 1-3281; RA. Schrewe H., Thompson J., Bona M., Hefta L.J., Maruya 
A.,; RA Hassauer M., Shively J.E., von Kleist S., Zimmermann W; RT 
"Cloning of the complete gene for carcinoembryonic antigen:; RT 
Analysis of its promoter indicates a region conveying cell; RT 
type-specific expression"; RLMol. Cell. Biol. 10:2738-2748(1990); FH 

25 Key Location/Qualifiers; FH; FT sigpeptide 
join(1769..1832,2725..2762); FT /gene="CEA"; FT exon 1659..1832; 
FT /number =1 /gene="CEA" /codon_start= 1659; FT exon 2725..3084; 
FT /number=2 /gene = "CEA" /codon_start=2725; SQ Sequence 3281 
BP; 847 A; 953 C; 871 G; 610 T; 0 other; CC 

30 
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SEQ ID No 21 

ID HSIFNGAMM standard; RNA; PRI; 1011 BP; AC M26683; DT 
23-NOV-1989 (Rel. 21, Created); DT 26-MAY-1992 (Rel. 32, Last 
5 updated, Version 5); DE Human interferon gamma (IFN-gamma) mRNA, 
complete cds; KW interferon gamma; type II; OS Homo sapiens (human); 
OC Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; OC 
Theria; Eutheria; Primates; Haplorfaini; Catarrhini; Hominidae; RN. [1]; 
RP 1-1011; RA Fan X., Stark G.R., Bloom B.R; RT "molecular cloning 

10 of a gene selectively induced by gamma; RT interferon from human 
macrophage cell line u937"; RLMol. Cell. Biol. 9:1922-1928(1989); FH 
Key Location/Qualifiers; FH; FT CDS 15. .131; FT /product = "interferon 
gamma" /gene = "IFN-gamma"; FT /codon_start=l; FT polyA_signal 
971. .976; FT /gene = "IFN-gamma"; SQ Sequence 1011 BP; 301 A; 236 

15 C; 184 G; 290 T; 0 other; 

SEQ ID No 2 

ID HSPSAA standard; DNA; PRI; 7130 BP; AC M27274; DT 
20 23-APR-1990 (Rel. 23, Last updated, Version 1); DT 02-FEB-1990 (Rel. 
22, Created); DE Human prostate-specific antigen gene, complete cds; 
KW Alu repetitive element; kallikrein; prostate specific antigen; OS Homo 
sapiens (human); OC Eukaryota; Animalia; Metazoa; Chordata; 
Vertebrata; Mammalia; OC Theria; Eutheria; Primates; Haplorhini; 
25 Catarrhini; Hominidae; RN [1]; RP 1-7130; RA Lundwall A; RT. 
"Characterization of the gene for prostate-specific antigen, a; RT human 
glandular kallikrein"; RL Biochem. Biophys. Res. Commun. 
161:1151-1159(1989); DR SWISS-PROT; P07288; PROS$HUMAN; FH 
Key Location/Qualifiers; FH; FT CDS 675.. 720; FT 
30 /note = "prostate-specific antigen, exon 1; FT /nomgen="APS" 
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/map="19ql3.3-qter"; FT /hgml_locus_uid = "LN0098S" " ; FT intron 
721.. 1958; FT /note="PSA intron A"; FT CDS 1959..2118; FT 
/note = "prostate-specific antigen, exon 2"; FT intron 21 19. .3755; FT 
/note="PSA intron B"; FT repeat_region 2583..2935; FT /note="Alu 
5 repeat"; FT CDS 3756. .4042; FT /note= "prostate-specific antigen, exon 
3"; FT intron 4043..4185; FT /note="PSA intron C"; FT CDS 
41 86. .4322; FT /note = "prostate-specific antigen, exon 4"; FT intron 
4323..5698; FT /note="PSA intron D"; FT CDS 5699. 5854; FT 
/note = "prostate-specific antigen, exon 5"; SQ Sequence 7130 BP; 1530 
10 A; 2024 C; 1867 G; 1709 T; 0 other; . 

SEQ ID No 8 

ID MMTRP15 standard; DNA; ROD; 1236 BP; AC X59513; DT 
15 26-JUL-1991 (Rel. 28, Created); DT 26-JUL-1991 (Rel. 28, Last updated, 
Version 2); DE Mouse 5* end of TRP1 gene for tyrosinase-related 
protein- 1; KW TRP1 gene; tyrosinase; tyrosinase-related protein- 1; OS 
Mus musculus (mouse); OC Eukaryota; Animalia; Metazoa; Chordata; 
Vertebrata; Mammalia; OC Theria; Eutheria; Rodentia; Myomorpha; 
20 Muridae; Murinae; RN. [1]; RA Jaekson I.J., Chambers D.M., Budd 
P.S., Johnson R; "The tyrosinase-related protein- 1 gene has a structure 
and promoter sequence very different from tyrosinase. ";Nucleic Acids 
Res. 19:3799-3804(1991) SQ Sequence 1236 BP; 357 A; 234 C; 282 G; 
363 T; 0 other; , 

25 

SEQ ID No 22 

ID HSCSFGMA standard; DNA; PRI; 3194 BP; AC M13207; DT 
07-JUN-1987 (Rel. 12, Created); DT 24-DEC-1990 (Rel. 26, Last 
30 updated, Version 2); DE Human granulocyte-macrophage 
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colony-stimulating factor (hGM-CSF); DE gene, complete cds; KW 
granulocyte-macrophage colony stimulating factor; OS Homo sapiens 
(human); OC Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; 
Mammalia; OC Theria; Eutheria; Primates; Haplorhini; Catarrhini; 
5 Hominidae; RN [1]; RP 1-3194; RA Kaushansky K., O'Hara PJ., 
Berkner K., Segal G.M., Hagen F.S.,; RA Adamson J.W; RT "Genomic 
cloning, characterization, and multilineage; RT growth-promoting activity 
of human granulocyte-macrophage; RT colony-stimulating factor"; RL 
Proc. Natl. Acad. Sci. U.S.A. 83:3101-3105(1986); RN [2]; RP 1-3194; 
10 RA Kaushansky K; RT; RL Unpublished; DR CPGISLE; HSCSFGMA; 
Release pre-1.0; DR SWISS-PROT; P04141; CSF2JHUMAN; SQ 
Sequence 3194 BP; 700 A; 859 C; 945 G; 690 T; 0 other; CC 

SEQ ID No 9 

15 

ID HSIL21 standard; DNA; PRI; 5737 BP; AC J00264; DT 29-JUL-1991 
(Rel. 28, Created); DT 29-JUL-1991 (Rel. 28, Last updated, Version 1); 
DE Human interleukin 2 (11-2) gene, complete coding sequence; KW 
immune response gene; interleukin; interleukin 2; lymphokine; KW T-cell; 

20 T-cell growth factor; OS Homo sapiens (human); OC Eukaryota; 
Animalia; Metazoa; Chordata; Vertebrata; Mammalia; OC Theria; 
Eutheria; Primates; Haplorhini; Catarrhini; Hominidae; RN [1]; RP 
431-624, 715-774, 3068-3211, 5057-5443; RA Maeda S., Nishino N., 
Obaru K., Mita S., Nomiyama H., Shimada K.,; RA Fujimoto K., 

25 Teranishi T., Hirano T., Onoue K; RT "Cloning of interleukin 2 mRNAs 
from human tonsils"; RL Biochem. Biophys. Res. Commun. 
1 15:1040-1047(1983); RN CC Key Location/Qualifiers; FH; FT CDS join 
(478..624,715..774,3068..3211,5057..5167); SQ Sequence 5737 BP; 1995 
A; 932 C; 922 G; 1888 T; 0 other; CC; ID HSIL4 standard; RNA; PRI; 

30 614 BP; AC M13982; DT 07-JUN-1987 (Rel. 12, Created); DT 
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03-SEP-1992 (Rel. 33, Last updated, Version 2); 
SEQIDNolO 

5 KW interleuldn; OS Homo sapiens (human); OC Eukaryota; Animalia; 
Metazoa; Chordata; Vertebrata; Mammalia; OC Theria; Eutheria; 
Primates; Haplorhini; Catarrhini; Hominidae; RN [1]; RP 1-614; RA 
Yokota T., Otsuka T., Mosmann T., Banchereau J., DeFrance T.,; RA 
Blancfaard D., De Vries J.E., Lee F., And K.i. "Isolation and 

10 characterization of a human interleuldn cDNA clone homologous to mouse 
B-cell stimulatory factor 1, that expresses B-cell- and T-cell-stimulating 
activities"Proc. Natl. Acad. Sci. U.S.A. 83:5894-5898(1986). ; DR 
SWISS-PROT; P05112; IL4_HUMAN; FH Key Location/Qualifiers; FH; 
FT mRNA <1..614; FT /note="IL-4 mRNA"; FT CDS 64.. 524; FT 

15 /note= "interleuldn 4" /gene="EL4" /partial; FT sig_peptide 64.. 135; FT 
/note— "interleuldn 4 signal peptide"; FT mat_peptide 136..522; FT 
/note = "interleuldn 4 mature peptide"; SQ Sequence 614 BP; 174. A; 150 
C; 129 G; 161 T; 0 other; 

20 SEQ ID No 11 

ID HSEL7A standard; RNA; PRI; 1589 BP; AC J04156; DT 
22-APR-1989 (Rel. 19, Created); DT 06-JUL-1989 (Rel. 20, Last 
updated, Version 1); DE Human interleuldn 7 (IL-7) mRNA, complete 

25 cds; KW interleuldn; interleuldn 7; OS Homo sapiens (human); OC 
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; OC 
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae; RN [1]; 
RP 1-1589; RA Goodwin R.G., Lupton S., Schmierer A., Hjerrild K.J., 
Jerzy R.,; RA Clevenger W., Gillis S., Cosman D., Namen A.E; RT 

30 "Human interleukin 7: Molecular cloning and growth factor activity; RT 
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on human and murine B-lineage cells"; RL Proc. Natl. Acad. Sci. U.S.A. 
86:302-306(1989); DR SWISS-PROT; P13232; IL7HUMAN; CC Draft 
entry and computer-readable sequence [1] kindly submitted by; CC 
R.Goodwin, 05-JAN-1989; FH Key Location/Qualifiers; FH; FT mRNA 
5 <1..1589; FT /note="interieukin 7 mRNA"; FT CDS 385..918; FT 
/note="interleukin 7 precursor"; FT CDS 385.. 459; FT 
/note="interleukin 7 signal peptide"; FT CDS 460..915; FT 
/note="interleukin 7"; SQ Sequence 1589 BP; 532 A; 284 C; 339 G; 434 
T; 0 other, 

10 

SEQ ID No 12 

Human tumour necrosis factor mRNA; ID HSTNFAA standard; RNA; 
PRI; 1585 BP; AC M10988; DT 16-JUL-1988 (Ret 16, Created); DT 

15 02-SEP-1992 (Rel. 33, Last updated, Version 2); DE Human tumor 
necrosis factor (TNF) mRNA; KW ; OS Homo sapiens (human); OC 
Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; OC 
Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae; RN [1]; 
RP 1-1585; RA Wang A.M., Creasey A. A., Ladner M.B., Lin L.S., 

20 Stickler J.,; RA Van Arsdell J.N., Yamamoto R., Mark D.F; RT 
"Molecular cloning of the complementary DNA for human tumor; RT 
necrosis factor"; RL Science 228:149-154(1985); DR SWISS-PROT; 
P01375; TNFA_HUMAN; FH Key Location/Qualifiers; FH; FT CDS 
86..787; FT /note = "tumor necrosis factor" /gene="TNFA"; FT 

25 /codon_start= 1; SQ Sequence 1585 BP; 352 A; 473 C; 389 G; 371 T; 0 
other; CC 

SEQ ID No 7 

30 ID MMTYR1 standard; DNA; ROD; 4758 BP; AC D00439; DT 
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14-FEB-1991 (Rel. 27, Created); DT 14-FEB-1991 (Rel. 27, Last 
updated, Version 1); DE Mouse tyrosinase gene, 5'flankand exon 1; KW 
melanin; melanocyte; monooxygenase; tyrosinase; OS Mus musculus 
(mouse); OC Eukaryota; Animalia; Metazba; Chordata; Vertebrata; 
5 Mammalia; OC Theria; Eutheria; Rodentia; Myomorpha; Muridae; 
Murinae; RN [1]; RP 2481-3363; RA ; RN [2]; RP 1-4758; RA 
Yamamoto H., Takeuchi S., Kudo T., Sato C, TakeuchiT; RT "Melanin 
production in cultured albino melanocytes transfected; RT with mouse 
tyrosinase cDNA"; RL Jpn. ■ J. Genet 64:121-135(1989); FH Key 

10 Location/Qualifiers; FH; FT miscsignal 2004. .2008; FT /note=v"putative 
CAT box"; FT misc_signal 2128..2133; FT /note = "putative CAT box"; 
FT misc_signal 2140..2146; FT /note = "putative TATA box"; FT 
misc_signal 2264.. 2268; FT /note = "putative CAT box"; FT misc_signal 
2272..2279; FT '/note = "putative TATA box''; FT misc_signal 

15 2286..2289; FT /note= "putative CAT box"; FT misc_signal 2434^2440; 
FT /note - "putative TATA box"; FT misc_feature 2465.. 2466; FT 
/note =" CAP sites"; FT CDS 2545.. > 3363; FT /note = "tyrosinase gene, 
exon 1" /partial; SQ Sequence 4758 BP; 1550 A; 859 C; 878 G; 1465 T; 
6 other; CC 

20 

SEQ ID No 23 

; ID HSIGB7 standard; RNA; PRI; 1491 BP. ; AC M27533; ; DT 

23-APR-1990 (Rel. 23, Created) ; DT 23-APR-1990 (Rel. 23, Last 
25 updated, Version 1) ; DE Human Ig rearranged B7 protein mRNA 

VCl-region, complete cds. ; KW constant region; immunoglobulin; 

variable region. ; OS Homo sapiens (human) ; OC Eukaryota; Animalia; 

Metazoa; Chordata; Vertebrata; Mammalia; ; OC Theria; Eutheria; 

Primates; Haplorhini; Catarrhini; Hominidae. ; RN [1] ; RP 1-1491 ; RA 
30 Freeman G.J., Freedman A.S., Segil J.M., Lee G., Whitman J.F., ; RA 
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Nadler L.M.; ; RT "B7, a new member of the Ig superfamily with unique 
expression on ; RT activated and neoplastic B cells"; ; RL J. Immunol. 
143:2714-2722(1989). ; CC Draft entry and computer readable copy of 
sequence [1] kindly ; CC provided by GJ. Freeman, 08-SEP-1989. ; FH 
5 Key Location/Qualifiers ; FH ; FT CDS 318. .1184 ; FT 
/note= "transmembrane protein Bl precursor" ; FT CDS 318..395 ; FT 
/note = "transmembrane protein Bl signal ; FT peptide" ; FT CDS 
396. .1181 ; FT /note= "transmembrane protein Bl" ; SQ Sequence 1491 
BP; 419 A; 343 C; 31 1 G; 418 T; 0 other; ; CC 

10 

* 

Example Is Demonstration of t issue specificity of 5' sequences nf 
murine tyrosinase and TRP-1 genes. 

A 2.5kb fragment from the 5' end of the tyrosinase gene was generated 

15 by PCR from genomic DNA of the B16 melanoma line. The 
oligonucleotides used (Pair 1: 5'-CGGAATTTCATGCCCCAGTTGAC- 
AACATAG-3', SEQ ID No 13; 5 , -CACTCGAGAAC ri l l T CTCCT- 
TTAGATCATACAA-3 ', SEQ ID No 14) were derived from the murine 
sequence published by Yamamoto et al (1989) Jpn. J. Genet. 64, 121-135. 

20 Shorter 5' sequences were generated also using oligonucleotides matched 
from the Yamamoto paper ( P a i r • 2 : 
5'CGGGAATTCATGCCCCAGTTGACAACATAG-3\ SEQ ID No 15; 
5'-GAGCTCGAGTGTCACAGACTTCTTTTCCA-3,SEQIDNol6;Pair 
3: 5'-AAACGAATTCCATCCAGTAAGTCCATTACT-3', SEQ ID No 

25 17; 5'-GAGCTCGAGTGTCACAGACTTCTTTC-3\ SEQ ID No 18). 
The 769bp fragment of the tyrosinase gene extends from position -815 to 
position -46 in the promoter. A 4.0kb fragment of 5' sequence of the 
TRP-1 gene was provided by Dr I J. Jackson, MRC Genetics Unit, 
Edinburgh and from this a 1.4kb fragment was derived by Xbal-Sall 

30 digestion. The promoter sequence at the 5' of TRP-1 gene may be 
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obtained following the methods described in Jackson et al (1991) NucL 
Acids Res. 19, 3799-3804. 

These 5' sequences, and the SV40 promoter as a control, were inserted 
5 upstream of the 0-galactosidase gene in the vector pNASS (obtained from 
Clontech Ltd) as indicated in Figure 1. 



Figure 1 shows (A) pNASS/3, a promoterless mammalian expression 
vector described by MacGregor & Caskey (1989) NucL Acids Res. 17, 

10 2365. Three unique restriction sites allow cloning of promoter sequences 
upstream of an expression cassette containing the SV40 splice 
donor/acceptor sequence (sd/sa), the 0-galactosidase gene and the SV40 
polyadenylation sequence. SV40 /S-Gal contains the SV40 early viral 
promoter (from the pBabe Puro vector, as described by Morgenstern & 

15 Land (1990) NucL Acids Res. 18, 3596, cloned into pNASSjS. (B) 2496 
bp <Tyr-jS-Gal 1) or 769 bp (Tyr-jS-CJal 2) fragments of the mouse 
tyrosinase promoter (Yamamoto et al (1989) Jap. J. Genet. 64, 121-135) 
were generated by PCR from genomic DNA of the B16.F1 melanoma cell 
line and cloned into the EcoRI and Xhol restriction sites of pNASSjS. (C) 

20 The plasmids TRP-l-0-Gal 1 and 2 were a gift from I. Jackson and 
contain 4 kbp and 1.4 kbp of the TRP-l promoter (Jackson et al (1991) 
NucL Acids Res. 19, 3798-3804) upstream of the j8-galactosidase gene and 
the SV40 polyadenylation sequence. The different constructs were 
transfected into a variety of murine and human cells of melanocytic and 

25 noiwnelanocytic origin, including B16 melanoma cells or NIH 3T3 
fibroblasts and subsequent 0-galactosidase activity was measured 72-96 
hours after transfection both by fluorometric assay, using 4- 
methylumbelliferyl-j3-D-galactoside (MUG) as substrate, and by 
histochemical analysis using X-gal as substrate. By both assays the 

30 various tyrosinase and TRP-1 promoter containing 5' sequences were 
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shown to drive /3-galactosidase activity in a murine melanocyte (Mel-ab) 
line and the B16 melanoma and the human melanoma lines SK23, HMB-2, 
Mel 8, TXM13> T8 and SS3. No activity was observed in the murine 3T3 
or L cell lines or the human HeLa, LS174T, HT29, HOS, SW620 and 
5 HUVEC lines, none of which are of melanocytic origin (see Figure 2 and 
Table 1). 

Figure 2 shows the relative activity of tyrosinase and TRP-l promoters in 
murine B16.F1 melanoma and NIH 3T3 cells. Cells were transfected with 

10 10 ng of the appropriate plasmid DNA using the calcium phosphate 
method. 72-96 hours after the calcium phosphate precipitate had been 
washed away the cells were analysed for expression of /3-galactosidase 
using the quantitative MUG assay. Data are expressed as mean of 
triplicate values ± SD. The data presented are representative of four 

15 similar experiments. 

In contrast, the SV40 promoter was able to direct expression of the 
reporter gene to high levels in both cell types. 

20 
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galactosidase from Tyrosinase and TRIM promoters. Each cell line 
10 indicated was transfected with 10 fig of plasmid DNA of Tyr-0-Gal 1 and 
2, TRP-l-0-Gal 1 and 2. pNASS-0 and SV4O-0-Gal were used in each 
case as a negative and positive control for transfection. Expression of 0- 
galactosidase was scored as positive (+) if several cells stained blue 96 
hours after transfection; a cell line was scored as negative (-) if no blue 
IS cells were observed after transfection and if the quantitative MUG assay 
showed no expression above background levels (transfection with pNASS- 
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These results confirm and extend the reports of other groups showing 
excellent tissue specificity of gene expression in melanocyte cells of either 
5 murine or human origin when the 5' promoter regions of either the 
tyrosinase or TRP-1 gene are utilised. 

Example 2: Ma terials and methods pertaining to the other Examples. 

10 Construction of Expression Plasmids and Retroviral Vectors. 
Subcloning was carried out via standard recombination DNA techniques 
(Sambrook et al (1989) Molecular cloning, a laboratory manual, Cold 
Spring Harbor Laboratory Pressi, NY, USA). Restriction endonuclease 
enzymes were supplied by Northumbria Biologicals (NBL, Cramlington, 

15 UK) and Taq polymerase was supplied by Stratech (Luton, UK). 
Oligonucleotides, synthesised on an Applied Biosystems 380B and purified 
by denaturing acrylamide electrophoresis, were provided by the 
Oligonucleotide Synthesis Laboratory, ICRF Clare Hall, South Mimms, 
UK. Polymerase chain reaction (PCR) amplification of DNA fragments 

20 was carried out on a Techne PHC-2 Thermocycler and reaction mixes 
were prepared in a hood separate from normal areas of DNA handling. 
Amplified DNA sequences were subcloned into the PCR II vector 
(Invitrogen; British Biotechnology Products Ltd, Oxford, UK) and their 
identities were confirmed by restriction endonuclease mapping. The 
25 correct fragments were then shuttled from PCR II into the appropriate 
expression plasmid. 

Cell Culture. All cell lines used in this study were checked routinely and 
found to be free of mycoplasma infection. Apart from Melab cells which 
30 were cultured in medium supplemented as described previously (Burrows 
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et al (1991) Cancer Res. 51, 4768-4775) the lines were grown in Eaglet 
minimal essential medium supplemented with 10% (v/v) fetal calf serum 
and 4 mM L-glutamine. HUVEC (Human umbilical vein endothelial 
cells) were maintained in Medium 199 (Gibco-Biocult Ltd, Paisley, 
5 Scotland) supplemented with Earie's salts, 20% (v/v) fetal calf serum, 
endothelial cell growth supplement (0.12 mg/ml) 0.09 mg.ml heparin and 
glutamine. Cultures were maintained at 37°C in a humidified atmosphere 
of 90% air/10% C0 2 . 

10 DNA Transfection. 10 6 adherent cells were transfected with 10 fig of 
plasmid DNA by calcium phosphate co-precipitation using the Profection 
method (Promega, Madison, WI) according to the manufacturer's 
instructions. 24 hours after the application of the precipitate to the tissue 
culture medium, cells were washed three times in serum-free medium and 

15 incubated in normal medium for 72-96 hours when they were stained for 
0-galactosidase expression. 

Intra-Tumoral Injection of DNA. 1-1 .5 x 10 5 tumour cells of either the 
B16 Fl murine melanoma or the Colo 26 colon carcinoma were injected 

20 subcutaneously in 100 fd inoculum volumes into the flank region of 
syngeneic mice (C57 for B16 Fl, Balb/C for Colo 26). Ten days later the 
animals were anaesthetised by halothane inhalation (ICI Pharmaceuticals, 
Macclesfield, UK), the tumours, approximately 4 mm in diameter, were 
located by palpation and injected with 1 fig DNA in 100 /tl volumes of 

25 either PBS or as calcium phosphate precipitates via a 27-gauge needle. 

Quantitative Assay for 0-Galactosidase Expression. Transfected cells 
were assayed for enzyme activity by the technique of MacGregor et al 
(1991) Methods in Molecular Biology 7, 217-235 (Ed., EJ. Murray) 
30 Humana Press Inc, Clifton, NJ, USA. Briefly cells were resuspended in 



WO 94/04196 



PCT/GB93/01730 



41 

Z buffer (60 mM Na2P0 4 .7H 2 0, 40 mM NaH 2 P0 4 .H 2 O t 10 mM KC1, 1 
mM MgS0 4 .7H 2 0) at 10 7 cells per ml. 105 fd of this cell suspension 
were dispensed per well of a microtiter plate and 15 fd of 1 % Triton X- 
100 were added to each well to give a final concentration of 0.1 %. After 
5 10 minutes at room temperature, 30 fd of 3 mM methylumbelIiferyl-0-D- 
galactoside (MUG) (Sigma, Poole, UK) in Z buffer were added to each 
well and the reaction was allowed to proceed for 90 minutes at 37°C. 75 
fd of 300 mM glycine, 15 mM EDTA, pH 11.2 were added to stop the 
reaction. Fluorescence was measured on a microtiter dish fluorescence 
10 reader (excitation at 350 nm and emission read at 450 nm). 

Cells expressing 0-galactosidase convert the MUG substrate, a non- 
fluorescent galactoside analogue, to the fluorescent molecule 4- 
methylumbelliferone. 

15 

Histochemical Detection of jS-Galactosidase-expressing Cells. 72-96 hours 
following DNA transfection, adherent cells were washed once in 
phosphate buffered saline (PBS) and fixed for 10 minutes at 4°C with 
3.8% formaldehyde in PBS. The fixative was removed by three washes 

20 with PBS and the cells were then incubated with X-gal solution [5-bromo- 
4-cMoro-3-indoyl~/?-galactopyranoside (Sigma) at 40 mg/ml in 
dimethylformamide was diluted to 1 mg/ml in 5 mM K 3 Fe(CN) 6 ; 5 mM 
K4Fe(CN) 6 .3H 2 0; 2 mM MgCl 2 ; 0.01% sodium deoxycholate; 0.2% 
NP40. All solutions were prepared using glass] at 37°C for at least 4 

25 hours according to published techniques (Bondi et al (1982) Histochem. 
76, 153-158). After staining the X-gal solution was removed, the cells 
were washed three times in PBS and the cells were inspected under a light 
microscope. Cells expressing the j3-galactosidase gene hydrolyse the 
chromogenic substrate X-gal to give the blue dye bromochloroindole. 

30 Control untransfected cells also were stained to assess the background 
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endogenous 0-gal staining. 

Detecting of /3-GaIactosidase-expressing Tumour Cells. 2, 4, 6 or 10 
days after injection of DN A into the tumours, animals were killed by C0 2 
5 inhalation, their tumours were excised, minced to 1 mm cubes with 
scalpels and pushed through a stainless steel sieve with a 5 ml syringe 
plunger, into culture medium. An aliquot of the resulting cell suspension 
was spun onto a glass microscope slide using a cytospin centrifuge. Slides 
were air-dried then fixed for 5 minutes in 3.8% formaldehyde in PBS. 
10 The cells were rinsed in PBS and incubated overnight in X-gal stain before 
being inspected under a light microscope for the presence of blue cells. 

Generation of Recombinant Retrovirus Stocks. The AM 12 packaging 
cell line (Markowitz et al (1988) Virol 167, 400-406) containing the 

15 packaging constructs for Moloney Leukaemia Virus was transfected with 
10 fig of retroviral plasmid DNA using the calcium phosphate co- 
precipitation method. 48 hours following transfection the cells were split 
into puromycin (Sigma) selection medium (1 ftg/ml) and surviving 
colonies were selected and pooled two weeks later. Virus was harvested 

20 from these producer cells by exposing fresh medium to 5 x 10 6 cells on 
a 90 mm plate and harvesting the medium 16 hours later. The medium 
was filtered through a 0.45 /xm filter (Nalge (UK) Ltd, Rotherwas, 
England) to remove cell debris and was then used to infect target cells. 
The target cells were split 24 hours earlier to a density of 10 5 cells per 90 

25 mm plate. Polybrene (Aldrich, Gillingham, Dorset) was added to the viral 
supernatant to 4 /xg/ml to enhance virus-cell surface interactions and the 
target cells were exposed to 1 ml of viral supernatant for 2.5 hours at 
37°C. 8 ml of normal growth medium were added to the plate and the 
infected cells were grown for a further 72-96 hours before being stained 

30 for expression of 0-galactosidase. 
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Example 3: Preparation of tyrosi nase promoter- or TRP-1 promoter- 
driven exp ression vectors containing cytokine cDNA'p . 

The pBCMGNeo-mIL-2 vector was provided by Dr P. Frost, University 
5 of Texas, Houston and is described in Eur. /. Immunol 18, 97-194 
(1988), although other vectors are suitable. This vector had been used to 
transfect B16 melanoma cells (a non-cell-type-specific approach) and IL-2 
producing cells had been selected (Fearon et at (1990) Cell 60, 397-403). 
The HCMV promoter of this vector was removed by XbaUSall digestion 

10 and replaced with the L4kb XbaUSall fragment of TRP-1 5' sequences 
or the 780bp tyrosinase 5' sequence fragment generated by Pair 3 
oligonucleotides. These constructs were transfected into murine B16 
melanoma cells or 3T3 fibroblasts. For the TRP-1 - EL2 construct a total 
of 60 puromycin-resistant clones were isolated and screened by ELISA for 

15 IL-2 production (Genzyme Ltd). Clones were characterised as high (2: 
960 pg/ml), intermediate (150-960 pg/ml) and low (^ 150 pg/ml) 
expressers. Of the 60 clones, 13 clones were found to be producing and 
secreting measurable quantities of EL-2 while ten clones of 3T3 cells and 
four pooled bulk populations of 3T3 did not contain any cells expressing 
20 detectable amounts of IL-2 activity. These results show that the tissue- 
specific promoter, TRP-1, is able to drive expression of a cytokine cDNA 
in an appropriate cell type. Repeated analysis over a 6-8 week culture 
period showed that the observed phenotype is stable. 

25 Alternatively, the EL-2 coding sequence can be incorporated into a 
tyrosinase promoter vector as follows: 

The murine IL-2 cDNA is PCR amplified from pBCMGNeo mXLrl using 
the primers GCGGCCGCGCATGTACAGCATGCAGCTCGCA (SEQ ID 
30 No 19) and GCGGCCGCTAAATAAATAGAGAGCCTTATG (SEQ ID 
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No 20). 

The PCR fragment is cloned into the vector PCRII (available from 
Invitrogen) and then excised from the PCRII vector using Noil digestion. 
5 The Notl fragment is cloned into the Notl site of Tyr-j8-Gal-l (described 
in Example 1) in place of the 0-gaIactosidase gene. This produces Tyr IL~ 
2 with a 2494 bp promoter from the tyrosinase gene driving expression of 
11^2. 

10 B16 clones have been injected into groups of syngeneic CS7 mice.. To 
date only the cell clone selected for drug resistance, ie lacking IL-2 
expression, is forming progressively growing tumours in these animals. 
The IL-2 secreting B16 cells are not forming palpable tumours and, if they 
do develop, are clearly growing at a slower rate in vivo. 

15 

In addition to the cells secreting IL-2, IL-2 expression is assessed using 
RT-PCR wherein RNA is isolated, primers such as oligo dT used to prime 
synthesis of cDNA from the mRNA using reverse transcriptase and the 
level of ILr2 RNA estimated by amplifying with IL-2-specific 
20 oligonucleotides. 

We have placed cDNA for DL-4 (bought from British Biotechnology Ltd) 
downstream of both promoter sequence but the construct may utilise any 
cytokine gene (eg GM-CSF, TNF, DFN), be combined with the HSV tk 

25 gene for ganciclovir selection, or may utilise cDNAs encoding for genes 
which might stimulate the immune response (eg MHC antigens, MAGE 
(melanoma antigens) etc). This procedure allows targeted expression of 
the requisite gene to the cell type of interest, ie melanocyte-derived cells. 
Replacement of the tyrosinase or TRP-1 promoter sequences with 

30 sequences which are expressed by other tumour types in a specific fashion 
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(eg 5' promoter sequences of the CEA gene for colorectal tumours, 5' 
sequences of prostate secreted antigen for prostatic tumours) permits 
targeted expression of similar genes to other tumour types. 

5 Example 4: Introducti on of tissue specific promoter-driven penes into 
target cells in vivo. 

There are two main routes of delivery:- 

10 1) Retroviral delivery 

2) Direct delivery 

Incorporation into a Retroviral Vector. The ability of the melanocyte- 
specific promoters to function after delivery via a retroviral vector was 

15 examined because retroviral-mediated gene delivery is a promising route 
for delivery of gene therapy in vivo (Miller (1992) Nature 357, 455-460). 
The retroviral vector pBabe Tyr-jS-Gal was constructed from the pBabe 
Puro vector (Morgenstern & Land (1990) NucL Acids Res. 18, 3587-3596) 
(Figure 3). Here 0-galactosidase is expressed from the 769 bp tyrosinase 

20 promoter fragment of Tyr-0-Gal 2 inserted into pBabe Puro in the 
opposite orientation to the direction of expression of the viral mRNA 
driven from the Moloney Leukaemia Virus (MLV) Long Terminal Repeat 
(LTR). 

25 Following transfection of the vector into the AM 12 amphotropic packaging 
cell line, recombinant retroviral particles were used to infect either B16 
or NIH 3T3 cells. 72-96 hours following infection, expression of the 0- 
galactosidase gene was observed preferentially in B16 cells relative to the 
NIH 3T3 target ceUs by both histochemical and fluorimetric assays. 
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These results demonstrate that the tyrosinase and TRP-1 promoters can 
confer tissue specificity of expression upon an heterologous gene in both 
human and murine melanocyte-derived cell lines when delivered in the 
context of a retroviral vector/ 

5 

Experiments on route 2 have yielded interesting results. Syngeneic 
C57/BL mice were injected s.c. in the flank region with lxlO 5 B16 cells 
and the animals were monitored until a tumour of approximately 0.4 x 0.4 
cm had developed. Similar Colo tumours were established in Balb-C 

10 mice. At this time a single injection of* 1.0 fig of the' tyrosinase 
promoter/pNASS DNA was inoculated in 100 /d volumes directly into the 
centre of the tumour either as * naked* DNA or as calcium phosphate- 
coprecipitated material. Similarly, pNASS-0 and TRP-j3-Gal-2 DNA was 
inoculated. At varying times thereafter, for example at 2, 4, (5 or 10 

IS days, mice were killed, and the tumours were removed and snap-frozen. 
Cryostat sections of these tumours were stained for 0-galactosidase 
activity. Protein expression, manifest by the detection of bright blue cells, 
was clearly apparent in the majority of the injected tumours. The Tyr-/?- 
Gal 2 construct caused the gradual accumulation of positive blue cells in 

20 the injected B16 tumours over the ten day period of examination; whereas 
the same construct injected into the non-melanocytic Colo 26 tumours 
produced no blue staining. Similar results were obtained in three 
independent replicate experiments and from these it was apparent that:- (1) 
the promoterless, control pNASS j8 construct produced no blue cells in 

25 either Colo 26 or B16 tumours; (2) there was a gradual increase in the 
proportion of blue cells in the positive groups over the 10 day period of 
examination (10 days was the last time-point examined because of 
increasing tumour burden) up to an estimated 10-15% of cells (3) no 
qualitative or quantitative difference was obvious between the tyrosinase 

30 or TRP-1 promoter elements or between material injected as naked DNA 
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or as a CaP0 4 -precipitate. Frozen sections of B16 tumours stained 10 
days after DNA injection showed similar results. Interestingly the only 
blue-staining tissue, apart from the neoplastic cells, was confined to the 
base of the hair follicles and thus, presumably, indicated transduction of 
5 normal melanocytes. 

These results show that direct gene transfer may be accomplished by intra- 
tumoural injections. Morphological assessment of the sections indicated 
that the blue cells were restricted to areas occupied by neoplastic tissue, 
10 which is presumed to reflect the tissue- specificity conferred by the 5' 
tyrosinase or TRP-l gene sequence. 

These experiments suggest that direct injections permit good levels of 
expression of introduced genes. The activity produced may be altered by 

15 modification of the introduced DNA (eg incorporation in liposomes, use 
of different precipitating material, variation in route of delivery). Taken 
in combination our results indicate that placing therapeutic genes under 
control of tissue-specific promoter regions may restrict expression to cells 
of a specific lineage. This could be important both for safety/specificity 

20 purposes and would permit the refinement of what otherwise may be a 
fairly non-specific event The utilisation of a cytokine gene has been 
shown to induce modifications in subsequent tumour behaviour. Direct 
delivery of DNA via an intratumoural injection has been shown to produce 
high levels of expression of the introduced gene suggesting that such 

25 promoter-restricted expression may be further limited to the target cells by 
the simple expedient of targeting inoculation. The use of genes encoding 
for proteins capable of eliciting a subsequent systemic response may 
permit this method to be used for disseminated, rather than localised, 
neoplastic disease. 

30 
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Example 5: c-erbB-2 promoter and re porter enzvme 

Reporter enzyme gene. The bacterial chloramphenicol acetyl transferase 
(CAT) gene was obtained from Promega as the a pCAT-basic w vector. 

5 

The CAT reporter system is designed to allow sensitive and rapid testing 
for eukaryotic transcriptional regulatory sequences. This reporter system 
relies on the linkage of genomic DNA fragments containing putative 
regulatory sequences to the chloramphenicol acetyltransferase (CAT> 

10 reporter gene. Transcriptional effects upon the CAT reporter : gene are 
detected after transaction into cultured cells. Since CAT is a bacterial 
gene, levels of CAT enzyme activity in crude cell extracts can be quickly 
and easily assayed with little or no background from endogenous cellular 
gene activity. The pCAT-Basic plasmid lacks eukaryotic promoter and 

IS enhancer sequences. This allows the researcher maximum flexibility in 
cloning any putative regulatory sequences into the convenient multiple 
cloning sites. Expression of CAT activity in cells transfected with this 
plasmid is dependent on insertion of a functional promoter upstream from 
the CAT gene. Enhancer elements can be inserted upstream from the 

20 promoter or at the BamHl site downstream from the CAT gene. 
Sequences to be tested for transcriptional activity can be cloned into the 
following unique sites located immediately upstream from the CAT gene: 
Xbal, Accl, Sail, Psil, Sphl and HindUl. Enhancer elements can be 
cloned separately into the BamHl site downstream from the CAT 

25 transcriptional unit. The vector also contains the gene for ampicillin 
resistance. 

Promoter. The human c-erbB-2 promoter has been cloned to -500 by two 
groups (Ishi et al (1987) Proc Natl Acad Sci USA 84, 4374-4378; Tal et 
30 al (1987) Mol Cell Biol 7, 2597-2601) and to -1500 by a third group 
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(Hudson et al (1990a) J Biol Chem 265, 4389-4393). We have taken 
oligonucleotides to 30b regions around +40 and -500 and, using PCR 
against human genomic DNA, recovered a 540bp fragment representing 
the c-erbB-2 proximal promoter. Using oligos to -1000 and -500 we then 
5 "PCRed" out a further 500bp representing the c-erbB-2 distal promoter. 
The two promoter regions were fused at the Smal site at -500 and the full 
promoter cloned upstream of the CAT gene to generate a reporter plasmid 
for assaying c-erMJ-2 promoter activity in cell lines in vitro. Further 
constructs were made by either deleting 5' regions of the promoter using 
10 convenient restriction enzyme sites, or using PCR technology, to generate 
a series of promoter deletion mutants linked to CAT 3' end always +40; 
5' ends as follows: -1000, -500, -400, -300, -213, -177, -100; (Figure 1). 



15 Construction of c-eriB-2 plasmid* The c-er&B-2 promoter was 
incorporated in the pCAT-basic plasmid to give the plasmid shown in 
Figure 1 by digesting the plasmid with Xbdl and then filling the ends with 
Klenow fragment to create a blunt-ended vector suitable for cloning the 
blunt-ended PCR products. 

20 

The CAT activity from the various promoter constructs was compared to 
baseline activity from the promoterless CAT parent plasmid by calcium 
phosphate mediated DNA transfection into a number of different breast 
cell lines* Immortalised normal and tumour lines which have low 
25 endogenous c-erbB~2 expression showed little activity of the c-erf>R-2 
promoter, ie all the reporter constructs containing c-crfeB-2 sequences 
generated no more CAT activity than the promoterless control plasmid. 
This result makes it unlikely that c-erbB-2 expression is actively repressed 
in these cell lines (by a tumour suppressor-like activity). 

30 
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Fvam ple 6i Promnter region nf the ca rcinnembrvonic antigen gene 

The CEA gene is cloned using standard methods as described by Schrewe 
et al (1990) Mol Cell. Biol. 10, 2738-2748 and sequenced using the 
5 dideoxy chain tennination method of Sanger et al (1980) J. Mol. Biol 
143, 161-178. 

To define the actual portion of the 5' untranslated region which is required 
for the promoter activity of the CEA gene, we carried out functional tests 

10 by placing restriction endonuclease fragments of various lengths from the 
putative promoter regions of both genes upstream of the CAT reporter 
gene and assaying for CAT activity in a transient transaction assay in two 
different human cell lines. For this purpose, we chose the CEA-prcducing 
adenocarcinoma cell line SW403 and, as a negative control, the HeLa cell 

15 line. The CEA promoter constructs showed an enhanced expression of the 
CAT gene in SW403 cells, which was nine times greater than in HeLa 
cells, when the shortest construct was used. It appears that cis regulatory 
sequences, which are responsible for this enhancement, along with a 
functional transcription initiator, are both present within the first 424 

20 nucleotides upstream of the translation^ start. It is also interesting that 
longer CEA constructs are approximately 50% less active in HeLa cells 
than is the shortest construct. A possible explanation for this phenomenon 
is that a silencer region could exist between nucleotides 
-424 and -832 upstream from the translational start, which reduces the 

25 activities in both cell lines through interaction with common mi/w-acting 
regulatory factors. Such silencer sequences have indeed been described 
for other genes. 

Thus, the promoter of the CEA gene is useful for expressing cytokines, 
30 according to the methods of the invention, in colon tumours. 
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As found here for CEA, a number of other eucaryotic genes have also 
been reported which do not contain obvious TATA boxes. The promoters 
of such genes can be divided into two classes. The members of the first 
class are G+C rich and are found primarily in housekeeping genes. 
5 These promoters usually contain several transcription initiation sites spread 
over a fairly large region, as well as potential binding sites for SpL The 
members of the second class are not G+C rich, are not constitutively 
active, but are regulated during differentiation or development and initiate 
transcription at only one or a few tightly clustered start sites. Included in 

10 this class are a number of genes that are regulated during mammalian 
immunodiiferentiation, eg the T-cell receptor 0-chain genes and die 
gene, as well as some Drosophila homeotic genes. The CEA gene shows 
a closer resemblance to this latter group, because its promoter is not 
obviously G+C rich, it contains no identifiable Spl-binding sites, it 

15 reveals only a few tightly clustered start sites, and, most importantly, it 
is not constitutively expressed. 

Figure 6 shows the nucleotide sequence from the promoter region of CEA 
compared with the promoter region of the non-specific cross-reacting 
20 antigen gene (NCA) and the CGM1 gene. The numbers indicate the 
distance in nucleotides from the initiation codon for each gene. Gaps have 
been introduced to allow optimal alignment. Identical nucleotides are 
indicated by dots. The cluster of transcriptional start sites determined for 
CEA and NCA by SI nuclease assays are indicated by arrows. 

25 

Example 7: Promoter reg ion of the prostate-specific antigen gene 

The PSA gene is cloned using standard methods as described by Riegman 
et al (1989) Biochem. Biophys. Res. Comm. 159, 95-102 and Lundwall 
30 (1989) Biochem. Biophys. Res. Comm. 161, 1151-1159 and sequenced 
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using the dideoxy chain termination method of Sanger et al (1980) /. MoL 
Biol 143, 161-178. 

The sequence of the promoter region of PSA gene, compared to that of 
5 the hGK-1 gene, is shown in Figure 7. Dots represent identical 
nucleotides. Putative transcriptional regulatory elements are boxed, 

PSA is expressed at a high level in the prostate; hGK-1, a human 
kallikrein-like gene, is expressed at lower level in the prostate. 

10 

Hie differences in nucleotide sequence between the PSA and hGK-1 
promoters are probably important determinants in prostate-specific gene 
expression. 

15 Thus, the promoter of the PSA gene is useful for expressing cytokines, 
according to the method of the invention, in prostate tumours. 

Example 8: Promoter region of the MUC1 gene 

20 The mucin gene, MUC1 , is selectively expressed in breast and pancreatic 
cell lines but not in non-epithelial cell lines. The promoter region for this 
gene may be obtained by the methods taught in WO 91/09867. 

The 5' sequences flanking the human MUC1 gene are analyzed for their 
25 ability to direct expression of a reporter gene (the chloramphenical 
transferase gene, CAT) in cell lines which normally express or do not 
express the MUC1 gene. A construct containing 2.9 kb of MUC1 5' 
flanking sequence shows expression of CAT in breast and pancreatic cell 
lines but not in the non-epithelial cell lines HT 1080, SK23 and HTB96. 
30 Deletion analysis shows that maximum expression was obtained in ZR-75 



WO 94/04196 



PCT/GB93/01730 



53 

(breast cancer line) and HPAP (pancreatic cancer line) with only 743 bp 
of 5' flanking sequence. Sequences within 1.6 kb of the transcriptional 
start site showed enhancing activity in a vector carrying an enhancerless 
SV40 promoter. Analysis of proximal 5' sequences in a promoterless 
5 CAT vector carrying the SV40 enhancer shows that sequences between - 
60 and -150 were crucial for tissue specific expression. An Spl site at - 
99/-90 and an E-box (E-MUC1) at -84/-64 in this region are shown by 
mutational analysis to play a role in the regulation of transcription. Gel 
shift analysis with oligonucleotides and nuclear extracts of ZR-75 showed 
10 protein binding to both of these sites. Spl binding activity is similar in 
ZR-75 and HT1080 cells whereas binding of factors to the E-MUC1 
oligonucleotide reveals quantitative and qualitative differences between 
epithelial and non-epithelial cells. 

15 Thus, the promoter of the MUC1 gene is useful for expressing cytokines, 
according to the method of the invention, in pancreatic and breast 
tumours. 

Example 9: Trea tment of patients 

20 

1. Patient selection 

a) Patients with metastatic malignant melanoma with good 
performance data (WHO Grade zero 1 or 2) with a life expectancy of at 
least three months, normal renal and liver function and haematology, 

25 normal bilirubin and no evidence of cerebral secondaries are selected. 

b) Written consent is obtained. 

c) Patients need not have received prior chemotherapy because of the 
low activity, toxicity and immunosuppression of such treatments. They 
can be administered after the gene therapy is completed, if indicated. 

30 d) Diagnosis of metastasis is confirmed by fine needle aspiration 
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cytology. 

2. Administration of constructs 

a) The constructs used are composed of a 769 bp fragment or a 2.5 
5 kb fragment of the 5' flanking sequence of the murine tyrosinase gene 

driving the human DL-2 gene within the promoterless mammalian 
expression vector pNASSjS (Clontech, Ca, USA). The decision to use the 
murine promoter sequence is based upon our demonstration that this 
sequence works well in human cells. Initial purification of the bulk grown 

10 plasmids DNA is achieved using QIAGEN-tips for plasmid purification 
(this is an anion exchange resin). The bacterial cells used as recipients for 
the plasmid constructs are the E. coli strain JM109. Verification of 
plasmid purity is by agarose gel electrophoresis. It is prepared to the 
same pyrogen free standards as monoclonal antibodies which are given in 

15 much higher amounts. It is administered in sterile saline. 

b) All injections are given by a qualified medical practitioner with 
MRCP or equivalent and training in medical oncology. A 27 gauge 
needle is used and local anaesthetic administered first. 

c) Patients are admitted for 24 hours following the injection and will 
20 be seen at three days and one week and thereafter weekly for one month 

and then monthly. The injection site is carefully examined and analgesia 
given as necessary. 

3. Studies on initial needle aspirate for diagnostic purposes 
25 a) immunocytochemistry for melanoma cells and assessment of cell 
cycle distribution. 

b) PCR to assess cytokine expression - EL-2, interferon-? and TNFot. 
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4. Dosage schedule 
tyrosinase/IL-2 

Dose Biopsy 
Cohort 1 100 fig DNA/200 fd 1 week 
5 2 100 tig DNA/200 jd 2 weeks 

5. Studies of excisional biopsy after construct injection 

a) immunochemistry for melanoma cells. 

b) genomic PCR to assess the construct 

10 c) staining for lymphocyte sub-populations and dendritic cells, PCR 
for IL-2 interferon-? and TNFor. In situ hybridisation for the same 
cytokines. 

d) assessment of cytotoxic T cell response to autologous melanoma 
cells. Cells obtained from the biopsies will be used in chromium release 
15 assays, as well as peripheral T cells. 

6. Studies on stored DNA preparations 

a) In order to verify that the prepared DNA has not been degraded, 
routine examination of an aliquot of the injected material by agarose gel 
20 electrophoresis should be carried out. 

Assessment of results 

Hie effectiveness of this approach is assessed by three criteria. 

25 D Assessment of IL-2 expression bv RTPCR in situ hybridisation and 
immunochemistry 



A similar level of expression within 10-15% of tumour cells is found. 
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2S Assessment of loca l immune response bv immunocvtochemistgg 



Lymphocyte subpopulations and dendritic cells are stained to assess 
subtypes of cells present after the injections. 

5 

3^ Assessment of cytotoxic T ceU responses 

There is 1-2 weeks of local TLrl production. 

10 There is a demonstration of a positive T cell response. 

Genes that can be expressed include cytokines such as TNFa, GM-CSF, 
TLrA, interferon-7 or the proteins involved in T cell antigen recognition 
like class 1 molecules, or B7. 

15 

Safety 

Considering the life expectancy of these patients who already have 
metastatic cancer, the risks of insertion of genetic material into the somatic 

20 cells of the body would appear to be minimal. Clearly there may be 
events resulting from positional integration into the genome, eg insertional 
mutagenesis, inactivation or enhancement of expression, which could 
theoretically be deleterious. However, these have not manifested 
themselves in over 200 injections into recipient mice and their importance 

25 appears to be more theoretical than practical. Moreover, should adverse 
immunological reactions occur, they are unlikely to be beyond control 
with a range of immunosuppressive agents. Again, the short life 
expectancy of these patients makes long term undesirable sequelae an 
unlikely event. The risks of chemotherapy with marrow suppression, 

30 allergic reactions, Budd-Chiari syndrome and infection would all seem to 
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pose much greater clinical problems than the local injection of DNA. 

Example 10: Co-injection of IL-2 expressing and B7-expressing DNA 
constructs into a melanoma 

5 

A TRP-1-B7 construct is made using PCR, the sequence information in 
the sequence listing and a DNA vector such that expression of the B7 
coding sequence is driven by the TRP-! promoter. 

10 The TRP-1-B7 construct and the TRP-l-IL-2 construct of Example 3 are 
prepared in sterile, pyrogen free water. 100 fig of each DNA construct 
in 200 fd of water is injected into the melanoma at weekly intervals until 
the tumour regresses. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATIONS 

(1) APPLICANT J 

(A) NAME* Imperial Cancer Research Technology Ltd 

(B) STREETS Sardinia House, Sardinia Street 

(C) CITYs London 

(£) COUNTRY* United Kingdom 

(F) POSTAL CODE (ZIP) I WC2A 3NL 

(G) TELEPHONES 071 242 1136 

(H) TELEFAX s 071 831 4991 

(I) TELEX s 265107 ICRF O 

(ii) TITLE OF INVENTIONS Tumour therapy 
(iii) NUMBER OF SEQUENCES s 22 

(iv) COMPUTER READABLE FORMS 

(A) MEDIUM TYPEs Floppy disk 

(B) COMPUTERS IBM PC compatible 

(C) OPERATING SYSTEMS PC— DOS /MS—DOS 

(D) SOFTWARES Patentln Release #1.0, Version #1*25 (EPO) 



(2) INFORMATION FOR SEQ ID NO J 1: 

(i) SEQUENCE CHARACTERISTICS S 

(A) LENGTHS 3281 base pairs 

(B) TYPEs nucleic acid 

(C) STRANDEDNESSs double 

(D) TOPOLOGY $ linear 

(ii) MOLECULE TYPEs DNA (genomic) 
(iii) HYPOTHETICAL s NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: Is 



GAGCTCCTCA 


CACGGACTCT 


GTCAGCTCCT 


CCCTGCAGCC 


TATCGGCCGC 


CCACCTGAGG 


60 


CTTGTCGGCC 


GCCCACTTGA 


GGCCTGTCGG 


CTGCCCTCTG 


CAGGCAGCTC 


CTGTCCCCTA 


120 


CACCCCCTCC 


TTCCCCGGGC 


TCAGCTGAAA 


GGGCGTCTCC 


CAGGGCAGCT 


CCCTGTGATC 


180 


TCCAGGACAG 


CTCAGTCTCT 


CACAGGCTCC 


GACGCCCCCT 


ATGCTGTCAC 


CTCACAGCCC 


240 


TGTCATTACC 


ATTAACTCCT 


CAGTCCCATG 


AAGTTCACTG 


AGCGCCTGTC 


TCCCGGTTAC 


300 


AGGAAAACTC 


TGTGACAGGG 


ACCACGTCTG 


TCCTGCTCTC 


TGTGGAATCC 


CAGGGCCCAG 


380 


CCAGTGCCTG 


ACACGGAACA 


GATGCTCCAT 


AAATACTGGT 


TAAATGTGTG 


GGAGATCTCT 


420 


AAAAAGAAAC 


ATATCACCTC 


CGTGTGGCCC 


CCAGCAGTCA 


GAGTCTGTTC 


CATGTGGACA 


480 


CAGGGGCACT 


GGCACCAGCA 


TGGGAGGAGG 


CCAGCAAGTG 


CCCGCGGCTG 


CCCCAGGAAT 


540 


GAGGCCTCAA 


CCCCCAGAGC 


TTCAGAAGGG 


AGGACAGAGG 


CCTGCAGGGA 


ATAGATCCTC 


600 


CGGCCTGACC 


CTGCAGCCTA 


ATCCTGAGTT 


CAGGGTCAGC 


TCACACCACG 


TCGACCCTGG 


660 
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TCAGCATCCC 


TAGGGCAGTT 


CCAGACAAGG 


CCGGAGGTCT 


CCTCTTGCCC 


TCCAGGGGGT 


720 


GAGATTGGAC 


ACAGACATCA 


CTCAGGAAAC 


GGATTCCCCT 


GGACAGGAAC 


CTGGCTTTGC 


780 


TAAGGAAGTG 


GAGGTGGAGC 


CTGGTTTCCA 


TCCCTTGCXC 


CAACAGACCC 


TTCTGATCTC 


840 


TCCGAGATAC 


CTGCTCTGTT 


CCTTTCTGGG 


TCCTCTGAG6 


ACCTGTTCTG 


CCAGGGGTCC 


900 


CTGTGCAACT 


CCAGACTCCC 


TCCTGGTACC 


ACCATGGGGA 


AGGTGGGGTG 


ATCACAGGAC 


960 


AGTCAGCCTC 


GCAGAGAGAG 


AGACCACCCA 


GGACTGTCA6 


GGAGAACATG 


GACAGGCCCT 


1020 


GAGCCGGAGC 


TCAGCCAACA 


GAGAOGGAGA 


GGGAGGGTCC 


CCCTGGAGCC 


TTCCCCAAGG 


1080 


AGAGGAGAGC 


CCAGAGTGAC 


CCACCTCCCT 


CCACCACAGT 


CCTCTCTTTC 


CAGGACACAC 


1140 


AAGACACCTC 


CCCCTCGACA 


TGCAGGAXCT 


GGGGACTCCX 


GAGACCTCTG 


GGCCTGGGTC 


1200 


TCCATCCCTG 


GGTCAGTGGC 


GGGGTTGGTG 


GTACTGGAGA 


CAGAGGGCTG 


GTCCCTCCCC 


1260 


AGCGACGACC 


CAGTGAGCCT 


TTTTCTAGCC 


CCCAGAGOCA 


CCTCTGTCAC 


CTTCCTGTTO 


1320 


GGGATGATCC 


CACCTTCCCA 


GAGCCCTGGA 


GAGCATGGGG 


AGACCCGGGA 


CCTGCTGGGT 


1380 


TTCTCT6TCA 


CAAAGGAAAA 


TAATCCCCCT 


GGTGTGACAG 


ACCCAAGGAC 


AGAACACAGC 


1440 


AGAGGTCAGC 


ACTGGQGAAA 


GACAGGTTGT 


CCACAGGGGA 


TGGGGGTCCA 


TCCACCTTGC 


1500 


CGAAAAGATT 


TGTCTGAGGA 


ACTGAAAATA 


GAAGGGAAAA 


AAGAGGAGGG 


ACAAAAGAGO 


1560 


CAGAAATGAG 


AGGGGAGGGG 


ACAGAGGACA 


CCTGAATAAA 


GACCACACCC 


ATGACCCACG 


1620 


TGATGCTGAG 


AAGTACTCCT 


GCCCTAGGAA 


GAGACTCAGG 


GCAGAGGGAG 


GAAGGACAGC 


1680 


AGACCAGACA 


GTGAGAGGAG 


CCTTGACAAA 


ACGTTCCTGG 


AACTCAAGCT 


CTTCTCCACA 


1740 


GAGGAGGACA 


GAGCAGACA6 


CAGAGACCAT 


GGAGTCTCCC 


TCGGCCCCTC 


CCCACAGATG 


1800 


GTGCATCCCC 


TGGGAGAGGC 


TCCTGCTCAC 


AGGTGAAGG6 


AGGACAACCC 


CXGGGAGAGG 


1860 


GTGGGAGGAG 


GGAGCACAGA 


GACTGGCTGG 


GGTCTCCTG6 


GTAGGACAGG 


GCTGTGA6AC 


1920 


GGACAGAGGG 


CTCCTGTTGG 


AGCCTGAATA 


GGGAAGAGGA 


CATCAGAGAG 


GGACAGGAGT 


1980 


GACACGAGAA 


AAATCAAATT 


GAACTGGAAT 


TGGAAAGGGG 


CAGGAAAACC 


TCAAGAGTTC 


2040 


TATTTTCCTA 


GTTAATTGTC 


ACTGGCCACT 


ACGTTTTTAA 


AAATCATAAT 


AACTGCATCA 


2100 


GATGACACTT 


TAAATAAAAA 


CATAACCAGG 


GCATGAAACA 


CTGTCCTCAT 


CCGCCXACCG 


2160 


CGGAGATTGG 


AAAATAAGCC 


CCAGGCTGTG 


GAGGGCCCTG 


GGAACCCTCA 


TGAACTCATC 


2220 


GACAGGAATC 


TGCAGCCTGT 


CCCAGGCACT 


GGGTGCAACC 


AAGATCACAC 


AAATCCCTGC 


2280 


CCTCATGAAG 


CTCATGCTCX 


CATGGGGAGG 


AAGACAGACA 


TACAAAGAGA 


TCTAGAATGT 


2340 


GAGGXGAGGT 


GTTGACAAGA 


GCCTGGAGGG 


AATAGAGCAG 


GGAAAGGXCA 


GAAAAGGAAG 


2400 


ACCCAAGGTC 


TCTAGAGGAG 


GTGTCAGGGA 


AGGGATCXCC 


CAAGAATGCC 


CTGATGTGAG 


2460 


CAGGACCTGA 


AGGCAATGGG 


GAGGGAGCCG 


TGAAGACCCC 


TGGAAAAGCA 


GATTCCACAC 


2520 


AGGGAAATGC 


CAAGGTCGGA 


GGTGCTAAGG 


AAATAGGAGA 


CACACTGCTG 


ACCTTGACCT 


2560 


AGTAGGACAC 


ACACACACAC 


ACACACACAC 


ACTCACTCAC 


TCCAGGGCTG 


GGGGATGAAG 


2640 
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A6ACCTGCTC AGGACCCAGG ACCCCATTTT TCCACCCTAA TGCATAGGTC CCAATATTGA 2700 

CCGATGCTCT CTGCTCTCTC CTAGCCTCAC TTCTAACCTT CTGGAACCCG CCCACCACTG 2760 

CCAAGCTCAC TATTGAATCC ACGCCGTTCA ATGTCGCAGA GGGGAAGGAG GTGCTTCTAC 2820 

TTGTCCAGAA TCTGCCCCAG CATCTTTTTG GCTACAGCTG GTACAAAGGT GAAAGAGTGG 2880 

ATGGGAACOG TCAAATTATA GGATATGTAA TAGGAACTCA ACAAGCTACC CCAGGGCCOG 2940 

CATACAGTGG TCGAGAGATA ATATACCCCA ATGCATCCCT GCTGATCCAG AACATCATCC 3000 

AGAATGACAC AGGATTCTAC ACCCTACACG TCATAAAGTC AG ATCTTGTG AATGAAGAA0 3060 

CAACTGGCCA GTTCCGGGTA TACCGTGAGT GATTCCCCCA TGACCTCTGG GTGTTGGGGG 3120 

TCAGTTCTAC TTCCCACAGA CAGGATTATC AGGCCTGGGC TGTGCTGTGG CCCCCTCTGC 3180 

ATTACGAACC ATGTTAGGGT TTGGGCATTT AGTGCAGGAT ACACACAGAA G AG ACAAACT 3240 

TCAACAGATC AGAATTCCTT TCCGGCATCC AGACCCTGCA G 3281 
(2) INFORMATION FOR SEQ ID NO J 2t 

(1) SEQUENCE CHARACTERISTICS 8 

(A) LENGTHS 7130 base pairs 

(B) TYPE* nucleic acid 

(C) STRANDEDNESSs double 

(D) TOPOLOGY i linear 

(ii) MOLECULE TYPES DNA (genomic) 
(iii) HYPOTHETICAL* NO 
(iii) ANTI-SENSE I NO 



(xi) SEQUENCE DESCRIPTION t SI 


JQ ID NOt 2s 






GAATTCCACA 


TTGTTTGCTG 


CACGTTGGAT 


TTTGAAATGC 


TAGGGAACTT 


TGGGAGACTC 


60 


ATATTTCTGG 


GCTAGAGGAT 


CT6TGGACCA 


CAAGATCTTT 


TTATGATGAC 


AGTAGCAATG 


120 


TATCTGTGGA 


GCTGGATTCT 


GGGTTGGGAG 


TGCAAGGAAA AGAATGTACT 


AAATGCCAAG 


180 


ACATCTATTT 


CAGGAGCATG 


AGGAATAAAA 


GTT CTAGTTT 


CTGGTCTCAG 


AGTGGTGCAG 


240 


GGATCAGGGA 


GTCTCACAAT 


CTCCTGAGTG 


CTGGTGTCTT 


AGGGCACACT 


GGGTCTTGGA 


300 


GTGCAAAGGA 


TCTAGGCACG 


TGAGGCTTTG 


TATGAAGAAT 


CGGGGATCGT 


ACCCACCCCC 


360 


TGTTTCTGTT 


TCATCCTGGG 


CATGTCTCCT 


CTGCCTTTGT 


CCCCTAGATG 


AA6TCTCCAT 


420 


GAGCTACAAG 


GGCCTGGTGC 


ATCCAGGGTG 


ATCTAGTAAT 


TGGAGAACAG 


CAAGTGCTAG 


480 


CTCTCCCTCC 


CCTTCCACAG 


CTCTGGGTGT 


GGGAGGGGGT 


TGTCCAGCCT 


CCAGCAGCAT 


540 


GGGGAGGGCC 


TTGGTCAGCC 


TCTGGGTGCC 


AGCAGGGCAG 


GGGCGGAGTC 


CTGGGGAATG 


600 


AAGGTTTTAT 


AGGGCTCCTG 


GGGGAGGCTC 


CCCAGCCCCA 


AGCTTACCAC 


CTGCACCCGG 


660 


AGAGCTGTGT 


CACCATGTGG 


GTCCCGGTTG 


TCTTCCTCAC 




ACGTGGATTG 


720 


GTGAGAGGGG 


CCATGGTTGG 


GGGGATGCAG 


GAGAGGGAGC 


CAGCCCTGAC 


TGTCAAGCTG 


780 
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AGGCTCTTTC CCCCCCAACC CAGCACCCCA GCCCAGACAG GGAGCTGGGC TCTTTTCTGT 840 

CTCTCCCAGC CCCACTTCAA GCCCATACCC CCAGCCCCTC CATATTGCAA CAGTCCTCAC 900 

TCCCACACCA GGTCCCCGCT CCCTCCCACT TACCCCAGAA CTTTCTCCCC ATTGCCCAGC 960 

CAGCTCCCTG CTCCCAGCTG CTTTACTAAA GGGGAAGTTC CTGGGCATCT CCGTGTTTCT 1020 

CTTTGTGGG6 CTCAAAACCT CCAAGGACCT CTCTCAATGC CATTGGTTCC TTGGACCGTA 1080 

TCACTGGTCC ATCTCCTGAG CCCCTCAATC CTATCACAGT CTACTGACTT TTCCCATTCA 1140 

GCTGTGAGTG TCCAACCCTA TCCCAGAGAC CTTGATGCTT GGCCTCCCAA TCTTGCCCTA 1200 

GG ATACCCAG ATGCCAACCA GACACCTCCT TCTTCCTAGC CAGGCTATCT GGCCTGAGAC 1260 

AACAAATGGG TCCCTCAGTC TGGCAATGGG ACTCTGAGAA CTCCTCATTC CCTGACTCTT 1320 

AGCCCCAGAC TCTTCATTCA GTGGCCCACA TTTTCCTTAG GAAAAACATG AGCATCCCCA 1380 

GCCACAACTG CCAGCTCTCT GATTCCCCAA ATCTGCATCC TTTTCAAAAC CTAAAAACAA 1440 

AAAGAAAAAC AAATAAAACA AAACCAACTC AGACCAGAAC TGTTTTCTCA ACCTGGGACT 1500 

TCCTAAACTT TGCAAAACCT TCCTCTTCCA GCAACTGAAC CTGGCCATAA GGCACTTATC 1560 

CCTGGTTCCT AGCACCCCTT ATCCCCTCAG AATCCACAAC TTGTACCAAG TTTCCCTTCT 1620 

CCCAGTCCAA GACCCCAAAT CACGACAAAG GACCCAATCC CCAGACTCAA GATATGGTCT 1680 

GGGCGCTGTC TTGTGTCTCC TACCCTGATC CCTGGGTTCA ACTCTGCTCC CAGAGCATGA 1740 

AGCCTCTCCA CCAGCACCAG CCACCAACCT GCAAACCTAG GGAAGATTGA CAGAATTCCC 1800 

AGCCTTTCCC AGCTCCCCCT GCCCATGTCC CAGGACTCCC AGCCTTGGTT CTCTGCCCCC 1860 

GTGTCTTTTC AAACCCACAT CCTAAATCCA TCTCCTATCC GAGTCCCCCA GTTCCCCCTG 1920 

TCAACCCTGA TTCCCCTGAT CTAGCACCCC CTCTGCAGGC GCTGOGCCCC TCATCCTGTC 1980 

TCGGATTGTG GGAGGCTGGG AGTGCGAGAA GCATTCCCAA CCCTGGCAGG TGCTTGTGGC 2040 

CTCTCGTGGC AGGGCAGTCT GCGGCGGTGT TCTGGTGCAC CCCGAGTGGG TCCTCACAGC 2100 

TGCCCACTGC ATCAGGAAGT GAGTAGGGGC CTGGGGTCTG GGGAGCAGGT GTCTGTGTCC 2160 

GAGAGGAATA ACAGCTGGGC ATTTTCCCCA GGATAACCTC TAAGGCGAGC CTTGGGACTG 2220 

GGGGAGAGAG GGAAAGTTCT GGTTCAGGTC ACATGGGGAG GCAGGGTTGG GGCTGGACGA 2280 

CCCTCCCCAT GGCTGCCTGG GTCTCCATCT GTGTCCCTCT ATGTCTCTTT GTGTCGCTTT 2340 

CATTATGTCT CTTGGTAACT GGCTTCGGTT GTGTCTCTCC GTGTGACTAT TTTGTTCTCT 2400 

CTCTCCCTCT CTTCTCTGTC TTCAGTCTCC ATATCTCCCC CTCTCTCTGT CCTTCTCTGG 2460 

TCCCTCTCTA GCCAGTGTGT CTCACCCTGT ATCTCTCTGC CAGGCTCTGT CTCTCGGTCT 2520 

CTGTCTCACC TGTGCCTTCT CCCTACTGAA GACACGCACG GGATGGGCCT GGGGGGACCC 2580 

TGAGAAAAGG AAGGGCTTTG GCTGGGCGCG GTGGCTCACA CCTGTAATCC CAGCACTTTG 2640 

GGAGGCGAAG GCAGGTAGAT CACCTGAGGT CAGGAGTTCG AG ACCAGCCT GGCCAACTGG 2700 

TGAAACCCCA TCTCTACTAA AAATACAAAA AATTAGCCAG GCGTGGTGGC GCATGCCTGT 2760 
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AGTCCCAGCT 


ACTCAGGAGG 


CTGAGGGAGG 


AGAATTGCTT 


GAACCTGGGA 


GGTTGAGGTT 


2820 


GCAGTGAGCC 


GAGACCGTGC 


CACTGCACTC 


CAGCCTGGGT 


GACAGAGTGA 


GACTCCGCCT 


2880 


CAAAAAAAAA AAAAAAAAAA 


AAAAAAAAAA 


AGAAAAGAAA 


AGAAAAGAAA 


AGGAATCTTT 


2940 


TATCCCTGAT GTGT6TG66T 


ATGAGGGTAT GAGAGGGCCC 


CTCTCACTCC 


ATTCCTTCTC 


3000 


CAGGACATCC 


CTCCACTCTT 


GGGAGACACA 


GAGAAGGGCT 


GGTTCGAGCT 


GGAGCTGGGA 


3060 


GGGGCAATTG 


AGGGAGGAGG 


AAGGAGAAGG 


GGGAAGGAAA 


ACAGGGTATG 


GGGGAAAGGA 


3120 


CCCTGGGGAG 


CGAAGTGGAG 


GATAGAACCX 


TGGGCCTGGA 


GGCCAGGCTA 


CCTACCCACT 


3180 


TGGAAACCCA 


CGCCAAAGCC 


GCATCTACAG 


CTGAGCCACT 


CTGAGGCCTC 


CCCTCCCCGG 


3240 


CGGTCCCCAC 


TCAGCTCCAA 


AGTCTCTCTC 


CCTTTTCTCT 


CGGACACTTT 


ATCATCCOCC 


3300 


GGATTCCTCT 


CTACTTGGTT 


CTCATTCTTC 


CTTTGACTTC 


CTGCTTCCCT 


TTCTCATTCA 


3360 


TCTGTTTCTC 


ACTTTCTGCC 


TGGTTTTGTT 


CTTCTCTCTC 


TCTTTCTCTG 


GCCCATGTCT 


3420 


GTTTCTCTAT 


GTTTCTGTCT 


TTTCTTTCTC 


ATCCTGTGTA 


TTTTCGGCTC 


ACCTTGTTTG 


3480 


TCACTGTTCT 


CCCCTCTGCC 


CTTTCATTCT 


CTCTGTCCTT 


TTACCCTCTT 


CCTTTTTCCC 


3540 


TTGGTTTCTC 


TCAGTTTCTG 


TATCTGCCCT 


TCACCCTCTC 


ACACTGCTGT 


TTCCCAACTC 


3600 


GTTGTCTGTA 


TTTTTGGCCT 


GAACTGTGTC 


TTCCCCAACC 


CTGTGTTTTT 


CTCACTGTTT 


3660 


CTTTTTCTCT 


TTTGGAGCCT 


CCTCCTTGCT 


CCTCTGTCCC 


TTCTCTCTTT 


CCTTATCATC 


3720 


CTCGCTCCTC 


ATTCCTGCGT 


CTGCTTCCTC 


CCGAGCAAAA 


GCGTGATCTT 


GCTGGGTOGG 


3780 


CACAGCCTGT 


TTCATCCTGA 


AGACACAGGC 


CAGGTATTTC 


AGGTCAGCCA 


CAGCTTCCCA 


3840 


GACCCGCTCT 


ACGATATGAG 


CCTCCTGAAG 


AATCGATTCC 


TCAGGCCAGG 


TGATGACTCC 


3900 


AGCCACGACC 


TCATGCTGCT 


CCGCCTGTCA 


GAGCCTGCOG 


AGCTCACGGA 


TGC7TGTGAAG 


3960 


GTCATGQACC 


TGCCCACCCA 


GGAGCCAGCA 


CTGGGGACCA 


CCTGCTACGC 


CTCAGGCTGG 


4020 


GGCAGCATTG 


AACCAGAGGA 


GTGTACGCCT 


GGGCCAGATG 


GTGCAGCCGG 


GAGCCCAGAT 


4080 


GCCTGGGTCT 


GAGGGAGGAG 


GGGAGAGGAC 


TCCTGGGTCT 


GAGGGAGGAG 


GGCCAAGGAA 


4140 


CCAGGTGGGG 


TCCAGCCCAC 


AACAGTGTTT 


TTGCCTGGCC 


CGTAGTCTTG 


ACCCCAAAGA 


4200 


AACTTCAGTG 


TGTGGACCTC 


CATGTTATTT 


CCAATGACGT 


GTGTGCGCAA 


GTTCACCCTC 


4260 


AGAAGGTGAC 


CAAGTTCATG 


CTGTGTGCTG 


GACGCTGGAC 


AGGGGGCAAA 


AGCACCTGCT 


4320 


CGGTGAGTGA 


TCCCTACTCC 


CAAGATCTTG 


AGGGGAAAGG 


TGAGTGGGGA 


CCTTAATTCT 


4380 


GGGCTGGGGT 


CTAGAAGCCA 


ACAAGGCGTC 


TGCCTCCCCT 


GCTCCCCAGC 


TGTAGCCATG 


4440 


CCACCTCCCC 


GTGTCTCATC 


TCATTCCCTC 


CTTCCCTCTT 


CTTTGACTCC 


CTCAAGGCAA 


4500 


TAGGTTATTC 


TT AC AG CACA 


ACTCATCTGT 


TCCTGCGTTC 


AGCACACGGT 


TACTAGGCAC 


4560 


CTGCTATGCA 


CCCAGCACTG 


CCCTAGAGCC 


TGGGACATAG 


CAGTGAACAG 


AGAGAGAGCA . 


4620 


GCCCCTCCCT 


TCTGTAGCCC 


CCAAGCCAGT 


GAGGGGCACA 


GGCAGGAACA 


GGGACCACAA 


4680 


CAGAGAAAAG 


CTGGAGGGTG 


TCAGGAGGTG 


ATCAGGCTCT 


CGGGGAGGGA 


GAAGGGGTGG 


4740 
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GGAGTGTGAC TGGGAGGAGA CATCCTGCAG AAGGTGGGAG TGAGCAAACA CCTGCCGCAG 
GGGAGCGGAG GGCCCTGCGG CACCTGGGGG AGCAGAGGGA ACAGCATCTG GCCAGGCCXG 
GGAGGAGGGG CCTAGAGGGC GTCAGGAGCA GAGAGGAGGT TGCCTGGCTG GAGTGAAGGA 
TCGGGGCAGG GTGCGAGAGG GAAGAAAGGA CCCCTCCTGC AGGGCCTCAC CTGGGCCACA 
GGAGGACACT GCTTTTCCTC TGAGGAGTCA GGAACTGTGG ATGGTGCTGG ACAGAAGCAG 
GAGAGGGCCT GGCTCAGGTG TCCAGAGGCT GCCGCTGGCC TCCCTATGGG ATCAGACTGC 
AGGGAGGGAG GGCAGCAGGG ATGTGGAGGG AGTGATGATG GGGCTGACCT GGGGGTGGCT 
CCAGGCATTG TCCCCACCTG GGCCCTTACC CAGCCTCCCT CACAGGCTCC TGGCCCTCAG 
TCTCTCCCCT CCACTCCATT CTCCACCTAC CCACAGTGGG TCATTCTGAT CACCGAACTG 
ACCATGCCAG CCCTGCCGAT GGTCCTCCAT GGCTCCCTAG TGCCCTGGAG AGGAGGTGTC 
TAGTCAGAGA GTAGTCCTGG AAGGTGGCCT CTGTGAGGAG CCACGGGGAC AGCATCCTGC 
AGATGGTCCT GGCCCTTGTC CCACCGACCT GTCTACAAGG ACTGTCCTCG TGGACCCTCC 
CCTCTGCACA GGAGCTGGAC CCTGAAGTCC CTTCCCTACC GGOCAGGACT GGAGCCCCTA 
CCCCTCTGTT GGAATCCCTG CCCACCTTCT TCTGGAAGTC GGCTCTGGAG ACATTTCTCT 
CTTCTTCCAA AGCTGGGAAC TGCTATCTGT TATCTGCCTG TCCAGGTCTG AAAGATAGGA 
TTGCCCAGGC AGAAACTGGG ACTGACCTAT CTCACTCTCT CCCTGCTTTT ACCCTTAGGG 
TGATTCTGGG GGCCCACTTG TCTGTAATGG TGTGCTTCAA GGTATCACGT CATGGGGCAG 
TGAACCATGT GCCCTGCCOG AAAGGCCTTC CCTGTACACC AAGGTGGTGC ATTACOGGAA 
GTGGATCAAG GACACCATCG TGGCCAACCC CTGAGCACCC CTATCAACTC CCTATTGTAG 
TAAACTTGGA ACCTTGGAAA TGACCAGGCC AAGACTCAAG CCTCCCCAGT TCTACTGACC 
TTTGTCCTTA GGTGTGAGGT CCAGGGTTGC TAGGAAAAGA AATCAGCAGA CACAGGTGTA 
GACCAGAGTG TTTCTTAAAT GGTGTAATTT TGTCCTCTCT GTGTCCTGGG GAATACTGGC 
CATGCCTGGA GACATATCAC TCAATTTCTC TGAGGACACA GATAGGATGG GGTGTCTGTC 
TTATTTGTGG GATACAGAGA TGAAAGAGGG GTGGGATCCA CACTGAGAGA GTGGAGAGTO 
ACATGTGCTG GACACTGTCC ATGAAGCACT GAGCAGAAGC TGGAGGCACA ACGCACCAGA 
CACTCACAGC AAGGATGGAG CTGAAAACAT AACCCACTCT GTCCTGGAGG CACTGGGAAG 
CCTAGAGAAG GCTGTGAGCC AAGGAGGGAG GGTCTTCCTT TGCCATGGGA TGGGGATGAA 
GTAAGGAGAG GGACTGGACC CCCTGGAAGC TGATTCACTA TGGGGGGAGG TGTATTGAAG 
TCCTCCAGAC AACCCTCAGA TTTGATGATT TCCTAGTAGA ACTCACAGAA ATAAAGAGCT 
CTTATACTGT GGTTTATTCT GGTTTGTTAC ATTGACAGGA GACACACTGA AATCAGCAAA 
GGAAACAGGC ATCTAAGTGG GGATGTGAAG AAAACAGGGA AAATCTTTCA GTTGTTTTCT 
CCCAGTGGGG TGTTGTGGAC AGCACTTAAA TCACACAGAA GTGATGTGTG ACCTTGTGTA 
TGAAGTATTT CCAACTAAGG AAGCTCACCT GAGCCTTAGT GTCCAGAGTT CTTATTGGGG 



4800 
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5580 
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GTCTGTAGGA TAGGCATGGG GTACTGGAAT AGCTGACCTT AACTTCTCAG ACCTG AGGTT 6780 

CCCAAGAGTT CAAGCAGATA CAGCATGGCC TAGAGCCTCA GATGTACAAA AACAGGCATT 6840 

CATCATGAAT CGCACTGTTA GCATGAATCA TCTGGCACGG CCCAAGGCCC CAGGTATACC 6900 

AAGGCACTTG GGCCGAATGT TCCAAGGGAT. TAAATGTCAT CTCCCAGGAG TTATTCAAGG 6960 

GTGAGCCCTG TACTTGGAAC GTTCAGGCTT TGAGCAGTGC AGGGCTGCTG AGTCAACCTT 7020 

TTACTGTACA GGGGGGTGAG GGAAAGGGAG AAGATGAGGA AACCGCCTAG GGATCTGGTT 7080 

CTGTCTTGTG GCCGAGTGGA CCATGGGGCT ATCCCAAGAA GGAGGAATTC . 7130 

(2) INFORMATION FOR SEQ ID NOs 3 s 

(i) SEQUENCE CHARACTERISTICS s 

(A) LENGTHS 858 base pairs 
<B) TYPEs nucleic acid 

(C) STRANDEDNESSs double 

(D) TOPOLOGY* linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(lii) HYPOTHETICAL* NO 

(lii) ANTI-SENSE s NO 

<vi> ORIGINAL SOURCE* 

(A) ORGANISM* Homo sapiens 

(xi) SEQUENCE DESCRIPTIONS SEQ ID NOs 3s 

CGAGCGGCCC CTCAGCTTGC GCGGCCCAGC CCCGCAAGGC TCCCGGTGAC CACTAGAGGG 60 

CGGGAGGAGC TCCTGGCCAG TGGTGGAGAG TGGCAAGGAA GGACCCTAGG GTTCATCGGA 120 

GCCCAGGTTT ACTCCCTTAA GTGGAAATTT CTTCCCCCAC TCCTCCTTGG CTTTCTCCAA 180 

GGAGGGAACC CAGGCTGCTG GAAAGTCCGG CTGGGGGGGG GACTGTGGGT TCAGGGGAGA 240 

ACGGGGTGTG G AACGGG ACA GGGAGCGGTT AGAAGGGTGG GGCTATTCCG GGAAGTGGTG 300 

GGGGGAGGGA GCCCAAAACT AGCACCTAGT CCACTCATTA TCCAGCCCTC TTATTTCTCG 360 

GCCGCTCTGC TTCAGTGGAC CCGGGGAGGG CGGGGAAGTG GAGTGGGAGA CCTAGGGGTG 420 

GGCTTCCCGA CCTTGCTGTA CAGGACCTCG ACCTAGCTGG CTTTGTTCCC CATCCCCACG 480 

TTAGTTGTTG CCCTGAGGCT AAAACTAGAG CCCAGGGGCC CCAAGTTCCA GACTGCCCCT 540 

CCCCCCTCCC CCGGAGCCAG GGAGTGGTTG GTGAAAGGGG GAGGCCAGCT GGAGAACAAA 600 

CGGGTAGTCA GGGGGTTGAG GATTAGAGCC CTTGTACCCT ACCCAGGAAT GGTTGGGGAG 660 

GAGGAGGAAG AGGTAGGAGG TAGGGGAGGG GGCGGGGTTT TGTCACCTGT CACCTGCTCG 720 

CTGTGCCTAG GGCGGGCGGG CGGGGAGTGG GGGGACCGGT ATAAAGCGGT AGGCGCCTGT 780 

GCCCGCTCCA CCTCTCAAGC AGCCAGCGCC TGCCTGAATC TGTTCTGCCC CCTCCCCACC 840 

CATTTCACCA CCACCATG 858 
(2) INFORMATION FOR SEQ ID NO: 4: 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH f 1581 base pairs 

(B) TYPEs nucleic acid 

(C) STRANDED NESS : double 

(D) TOPOLOGY t linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE * NO 

<vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 



(xi) SEQUENCE DESCRIPTION J S 


EQ ID NO: 4: 






CATGGTGTCC GACTTATGCC 


CGAGAAGATG TTGAGGAAAC TTATCGCTTA TCTGCTTCTC 


60 


ATAGAGTCTT GCAGACAAAC 


TGCGCAACTC 


GTGAAAGGTA GGCGGATCTG 


GGTCGACCTG 


120 


CAGGTCAACG GATCCCTTCT 


TGACCAGTAT 


AGCTGCATTC TTGGCTGGGG 


CATTCCAACT 


180 


AGAACTGCCA AATTTAGCAC ATAAAAATAA 


GGAGGCCCAG TTAAATTTGA 


ATTTCAGATA 


240 


AACAATGAAT AATTTGTTAG 


TATAAATATG 


TCCCATGCAA TATCTTGTTG 


AAATTAAAAA 


300 


AAAAAGTCTT CCTTCCATGC 


CCCACCCCTA 


CCACTAGGCC TAAGGAATAG 


GGTCAGGGGC 


360 


TCCAAATAGA ATGTGGTTGA GAAGTGGAAT TAAGCAGGCT AATAGAAGGC AAGGGGCAAA 


420 


GAAGAAACCT TGAATGCATT 


GGGTGCTGGG 


TGCCTCCTTA AATAAGCAAG 


AAGGGTGCAT 




TTTGAAGAAT TGAGATAGAA 


GTCTTTTTGG 


GCTGGGTGCA GTTGCTCGTG 


GTTGTAATTC 


540 


CAGCACTTTG GGAGGCTGAG 


GCGGGAGGAT 


CACCTGAGGT TGGGAGTTCA 


AGACCAGCCT 


600 


CACCAACGTG GAGAACCCTG 


TCTTTACTAA 


AAATACAAAA AATTCAGCTG 


GTCATGGTGG 


660 


CACATG CCTG TAATCCCAGC 


TGCTCGGGAG 


GCTGAGGCAG GAGAATCACT 


TGAACCAGGG 


720 


AGGCAGAGGT TGTGGTGAGC 


AGAGATCGCG 


CCATTGCTCT CCAGCCTGGG 


CAACAAGAGC 


780 


AAAAGTTCGT TTAAAAAAAA AAAAAAGTCC TTTCGATGTG ACTGTCTCCT 


CCCAAATTTG 


840 


TAGACCCTCT TAAGATCATG 


CTTTTCAGAT 


ACTTCAAAGA TTCCAGAAGA 


TATGCCCCGG 


900 


GGGTCCTGGA AGCCACAAGG 


TAAACACAAC 


ACATCCCCCT CCTTGACTAT 


CAATTTTACT 


960 


AGAGGATGTG GTGGGAAAAC 


CATTATTTGA 


TATTAAAACA AATAGGCTTG 


GGATGGAGTA 


1020 


GGATGCAAGC TCCCCAGGAA 


AGTTTAAGAT 


AAAACCTGAG ACTTAAAAGG 


GTGTTAAGAG 


1080 


TGGCAGCCTA GGGAATTTAT 


CCCGGACTCC 


GGGGGAGGGG GCAGAGTCAC 


CAGCCTCTGC 


1140 


ATTTAGGGAT TCTCCGAGGA 


AAAGTGTGAG 


AACGGCTGCA GGCAACCCAG 


GCGTCCCGGC 


1200 


GCTAGGAGGG ACGACCCAGG 


CCTGCGCGAA 


GAGAGGGAGA AAGTGAAGCT 


GGGAGTTGCC 


1260 


GACTCCCAGA CTTCGTTGGA 


ATGCAGTTGG 


AGGGGGCGAG CTGGGAGCGC 


GCTTGCTCCC 


1320 


AATCACAGGA GAAGGAGGAG 


GTGGAGGAGG 


AGGGCTGCTT GAGGAAGTAT 


AAGAATGAAG 


1380 


TTGTGAAGCT GAGATTCCCC 


TCCATTGGGA 


CCGGAGAAAC CAGGGGAGCC 


CCCCGGGCAG 


1440 
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CCGCGCGCCC CTTCCCACGG GGCCCTTTAC TGCG CCGC6C GCCCGGCCCC CACCCCTCGC 1500 
AGCACCCCGC GCCCC6C6CC CTCCCAGCCG GGTCGAGCCG GAGCCATGGG GCCGGAGCCG 1560 
CAGTGAGCAC CATGGAGCTG G 1581 
(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS t 

(A) LENGTHS 1305 base pairs 

(B) TYPEs nucleic acid 

(C) STRANDEONESSz double 

(D) TOPOLOGY x linear 

(ii) MOLECULE TYPEs DNA (genomic) 

(iii) HYPOTHETICAL S NO 

(iii) ANTI- SENSE* NO 

(vi) ORIGINAL SOURCES 

(A) ORGANISMS Homo sapiens 

(ix) FEATURES 

(A) NAME /KEYS CDS 

(B) LOCATIONS 1204. .1284 



(xi) SEQUENCE DESCRIPTIONS SEQ ID NO; 5 s 



GGATCCGTCC 


CGGGACTAGC 


AGGGCTTTGG 


GCAGCAACCC 


GCAGGAGCCC 


GACCGCCTGT 


60 


GGCCAGGTCC 


GGGCAGCTGG 


TGGGGGAGGT 


TCCAGAGGTC 


CACGCCATTC 


GTGGACGCAG 


120 


TCTCTAGTGT 


CCTCTCCGCG 


TCCCACTTCA 


CTGCCCCATC 


CCCTTTCCTG 


CGAGAGCCTG 


180 


GACTTGGAAG 


GCACCTGGGA 


GGGTGTAAGC 


GCCTTGGTGT 


GTGCCCATCT 


GGGTCCCCAG 


240 


AAGAGCGGCG 


GGAACTGCGG 


CCGCCCGGAC 


GGTGCGGCCA 


GACTCCAGTG 


TGGAAGGGGA 


300 


GGCAGCTGTT 


CTCCCAGGCG 


GCCGTGGGGG 


GCAGCAGAGG 


GGACGGCGAC 


AGGTGCGGGA 


360 


GCCCCTCCCG 


GGGTAGAAGT 


GGAAAGGCGG 


GCTCCGGGGT 


CTGTT CCCAG 


GCTGGAAACC 


420 


ACCCCCGCCC 


CCCATCCAAA 


TCCCCGGGAG 


AGGCCCGGCC 


GGCGCCGGGT 


CTGGAGGAGG 


480 


AAGCGGCCAG 


AG ACAGTG CA 


ATTTCACGCG 


GTCTCTGTGG 


CTCGGGTTCC 


TGGGCTGGGT 


540 


GGATGAATTA 


TGGGGTTTCG 


AGTCTGGGAG 


AAACTGAGGT 


GGCCTGGACG 


TGAGGCAAAA 


600 


AACACCCTCC 


CCCTCAAAAA 


CACACAGAGA 


GAAATATTCA 


CATTCTGAGA 


GAAAATCCAC 


660 


CAAGTGAACC 


AACCGGCTAG 


GGGAGTTGAG 


TGATTTGGTT 


AATGGGCGAG 


GCCAACTTTC 


720 


AGGGGGCAGG 


GCTTTGGAGA 


GCTTTCCACT 


CCCTCATTCA 


TTACCCTTCC 


CTGGATCTGG 


780 


GGGCTTTCGG 


AATCTCGACC 


TCCCCTTGGC 


CTATCTCCTG 


CAGAAAAATT 


AGGGTGAGCC 


840 


CCATCCTCGA 


TCTGCTCCGC 


CAAGTTGCGG 


GACCGCGGGG 


CGTGGCACGC 


TCAGGGGCAG 


900 


GCGGTCCGAG 


GCTCCGCAAT 


CCCCACTCCA 


GCCTCGCGCG 


GGAGGGGGCG 


CGGCCCGTGT 


960 


GACTCACCCC 


CTTCCCTCTG 


CGTTCCTCCC 


TCCCTCTCTC 


TCTCTCTCTC 


ACACACACAC 


1020 


ACCCCTCCCC 


TGCCATCCCT 


CCCCGGACTC 


CGGCTCCGGC 


TCCGATTGCA 


ATTTGCAACC 


1080 
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TCCGCTGCCG TCGCCGCAGC AGCGACGAAT TCGCGAGCGG TTCAGGTGGC TCTTGCCTCG 1140 

ATGTCCTAGC CTAGGGGCCC CCGGGCCGGA CTTGGCTGGG CTCCCTTCAC CCTCTGCGGA 1200 

GTC ATG AG6 GCO AAC GAC GCT CTG CAG GTG CTG GGC TTG CTT TTC AGO 1248 
Met Arg Ala Asn Asp Ala Leu Gin Val Leu Gly Leu Leu Phe Ser 
1 5 10 15 

CTG GCC CGG GGC TCC GAG GTG GGC AAC TCT CAG GCA GGTAAGTGCC 1294 
Leu Ala Arg Gly Ser Glu Val Gly Asn Ser Gin Ala 
20 25 

CAGAGAGCAC C 1305 

(2) INFORMATION FOR SEQ ID NO: 6t 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 6: 

Met Arg Ala Asn Asp Ala Leu Gin Val Leu Gly Leu Leu Phe Ser Leu 
1 5 10 15 

Ala Arg Gly Ser Glu Val Gly Asn Ser Gin Ala 
20 25 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4752 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(ili) ANTI-SENSE S NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 



AATTCATGCC CCAGTTGACA ACATAGCTGG TTCAAGACAA ACCTGAGCAC 


TTTTCATCAC 


60 


TGAAATCTTC ACTCTGGACC 


AATCAACATT 


CATACATTCC 


CTTCTTTACT 


TTAACACTCT 


120 


CCTGAGAGCT ATTTCTCTTC 


TCATCCTAAT 


TCTCTGCTCA 


TATCACATTT 


CAGCAGCTTA 


180 


CATATGAAAA TCTTGTACAT 


TCCCATGAGA 


TTGCATTGAA 


ATTGCTTCAA 


CCCTTTTCTA 


240 


TGTCCATATG TATACCTTAC 


TTCTCATTCC 


TATTGCTTTG 


TAGCTACAGA 


AGCCTTAGCA 


300 


GTCTTTAGAA ATCTTGGGAG 


TGTTATGCCT 


CTTCCACTTA 


AAGCTTTAAT 


TCAAGAACTC 


360 


ATTCTCTAGA AGTTTAAACA 


ACTGCTATTC 


TGCTCTTCTA 


TGACTCTTTA 


ACATGTCTCT 


420 


CAAAATATGT TTTCTCCCAG 


AAAACTCTCC 


TCAACTTTCT 


TCAAAATTAA 


TAG ACTACAT 


480 
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TTTGCCCCAC AGAAATTCTT CTATAGGAAC TTCAATTTTT TTATTTAATC CAGGAAAACA 540 

TATTTAAACA TACTTAGCTA TAAATATTAA GATTAAATTG TGATAAAAAC TTATTTTTGA 600 

CAGTGGTGGA AACTGTCCTA TTAAATTTAC CATATCATTC CAATTTTATC CTCAGAAAAT 660 

GCCCAATAGA TGATCAATAA ATATTTGCAG TATACTTATT GCATTCCCAG CTATAGCTAA 720 

TATAGATTGC ATTAAGACTG TCTTAGGGTA GACAATAGTC AAACAGAATT GGATAGGGCA 780 

TATTCAAATT CTGTAGCTAC CAAATGTGTA TACAAGGGAA AAATGTGTAA ATGAATACTT 840 

ACATAGAAAG AGCAGGACTG GTGATAAGAT TGTCATGTAA ATAAGTTGGT GGAGACTTGA 900 

GTTACATTGC AGAGGAATAA AGAGAGCTTT TAATTTAAGA GAATCAGTCC TATGAAGAAA 960 

ACAAAGACAG CAAACTACAC TTCAGTCATT CTCTCATGAG GCTCTACAGA TTTTGAAAAT 1020 

GAGCATGGGA AAGCCTTAGA AAATGAGCAT GGGAAAGCCC TTAGAAAGAG TGATGAGGAT 1080 

TTTCTGAAAT TTTGCTAAGA TGCCTGATAG TAGAACAAGG GAAAGAAAAA GTTGTTAATT 1140 

AAGTTTAACA GGAGAAGCTG GATTCCACCT GCAACTAGGT GGGAAAGAGT TACAAAAGOC 1200 

CCCATGTAGC TACAAGTAAT ACATAACAAA AAATCCCCAG GGGAAACGAG GTCCTGTGTT 1260 

AATCTACTAT GGGCTTTAAG AGAAGAAAAA CAAAAAAGGA CTAAGACTCT GAAGGAAATC 1320 

ATGTGAGAGC TTCCTTATTC CAGCAAGAGA CAAAATCTCC ATGGAAACTT CTCTTTCCTG 1380 

CACCCACACA CTGTTCTTTC TACCTCGCAA GGCTGCCTTG AATCTCAAGA GAATCCTATG 1440 

GAGCAGTCAA CACATTTTAA ATACTGAAAC AAACCCTAGG GAAGAAAAGA AGCAGAGCAG 1500 

GCTGACATTC CAGCATTATC AGGAAAGCAA TGATTTTCCT AGATTTCCGC AGCCCCAGTG 1560 

TTCAGATAAA GGGTTTCCTC AAACCTTCAC TTCCTTTCTC CTACAGTATA AATTAAAAGA 1620 

ACCACTCAGC TTTTAGTATG AAGCAGCATA GAGAAGGGAG TTCCAAAGAG AOGTTTGTTT 1680 

CTTGACCATT CTCATTATCC TTCTTTCATG GAGCAGTGCT ATTCAAACCA TCCAGTAAGT 1740 

CCATTACTCA CTTCCACATT TTATGAGCAA AATAATAAAA GAGAGATAGA GTAAGAATGA 1800 

AGGAGAGAGA GAGAGAGAGA GAGAGAGAGA GAGAGAGAGA GAGAGAGAGA GAGAGGAAGA 1860 

GAGAGAGAGA GAGAGAGAGA GAGAGAGAGA GAGAGAGAGA GAGAGAGAGA GAGAGAGAGG 1920 

TOTTTTCTTA ACTAGAAACT TTATGCATTG AAGCAGTTCA CCAAAATAAC AAAGTAACAA 1980 

AGTAAGATAT CTTTGGAATA ATCAATTCAA GATAATCAAG GAAAAATGAG AGGCAACTAT 2040 

TTTAGACTGA TTACTTTTAT AAAATAAATA AGCTCAGCTT AG CC AG AT AT AAGCAATATT 2100 

CTGAGTTCTG AAGAAAAATT TTTGACAAAA TGAGTTCTAT AAATGTTATT GTCTACTTAT 2160 

GATCTCTAAA TACAACAGGC TTGTATTCAG AATCTAGATG TTTCATGACC TTTATTCATA 2220 

AGAGATGATG TATTCTTGAT ACTACTTCTC ATTTGCAAAT TCCAATTATT ATTAATTTCA 2280 

TATCAATTAG AATAATATAT CTTCCTTCAA TTTAGTTACC TCACTATGGG CTATGTACAA 2340 

ACTCCAAGAA AAAGTTAGTC ATGTGCTTTG CAGAAGATAA AAGCTTAGTG TAAAACAGGC 2400 

TGAGAGTATT TGATGTAAGA AGGGGAGTGG TTATATAGGT CTTAGCCAAA ACATGTGATA 2460 
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w X X wV*nW 


wVJU X X VvUlljVj 


AAAAGAAGTC 


TGTGACACTC ATTAACCTAT 


TGGTGCA6AX 




x x a w x ax uni 


W X AAA vuA \9n 


AAATGTTCTT 


GGCTGTTTTG TATTGCCTTC 


TGTGGAGTTT 


neon 




WAA wv*>v«inx X 


TTCCTCGAGC 


CTGTGCCTCC TCTAAGAACT 


Xvx IwOWinn 


ZD4U 


AGAATGCTGC 


CCACCATGGA 


TGGGTGATGG GAGTCCCTGC GGCCAGCTTT 


Wiwu Lnunuu 


« /uu 


TTCCT6CCAG 


GATATCCTTC 


TGTCCAGTGC 


ACCATCTGGA CCTCAGTTCC 


CCx*x*CAAAGG 


Of fin 

a /DU 


GGTGGATGAC 


CGTGAGTCCT 


GGCCCTCTGT 


GTTTTATAAT AGGACCTGCC 


AGTGCTCAGG 




CAACTTCATG 


GGTTTCAACT 


GCGGAAACTG 


TAAGTTTGGA TTTGGGGGCC 


CAAATTGTAC 


noon 


AGAGAAGCGA GTCTTGATTA 


GAAGAAACAT 


TTTTGATTTG AGTGTCTCCG 


AAaaGAATaa 


Z94U 


GTTCTTTTCT 


TACCTCACTT 


TAGCAAAACA 


TACTATCAGC TCAGTCTATG 


TCAXCCCCaC 


^ Ann 
3QOU 


AGGGACCTAT GGCGAAATGA 


ACAATGGGTC 


AACACCCATG TTTAATGATA 


TCAACATCTA 


3060 


CGACCTCTTT 


GTATGGATGC 


ATTACTATGT 


GTGAAGGGAC AGACTGCTTG 


GGGGCTCTGA 


3X20 


AATATGGAGG 


GACATTGATT 


TTGCCCATGA 


AGCACGAGGG TTTCTGCCTT 


GGCACAGACT 


3180 


TTTCTTGTTA 


TTGTGGGAAC 


AAGAAATTCG 


AGAACTAACT GGGGATGAGA 


ACTTCACTGT 


3240 


TCCATACTGG 


GATTGGAGAG 


ATGCAGAAAA 


CTGTGACATT TGCACAGATG 


AGTACTTGGG 


3300 


AGGTCGTCAC 


CCTGAAAATC 


CTAACTTACT 


CAGCCGAGCA TCCTTCTTCT CCTCCXGGCA 


3360 


GGTAAGATGC 


ACTATATAGA 


GAGAGTTGCA AAGACTGGTA CTTCAGCAGC 


CACATTTTCA 


3420 


TGCTCTGTGA 


GCATCTCXGA TAATATCTCA GGGCAGAAAA TGTGCCTTAC TAACAGAXGT 


3480 


TAATGCTTCT 


TGATTTCTTT 


TTCTCTTTTG 


AGAACTCTTC AAAGTGTTAT 


TAAACAAATA 


3540 


TCTATGTGCT TATTTGTCTT AATATCTAAC AGCTTAGTTA GATTTCTAAG 


CTGCTATAAA 


3600 


CAAGGACTGA 


TTGGTTCACC 


ACTGTATTGT 


TAGCACCTCC TATGTATCTA 


ATAACAGTAA 


3660 


CTCAGTTATT 


AAGAATGGAT 


AGAAACCAGA 


TTATCTTAGT TCAATTTCTA 


GTAATATTAA 


3720 


ACTTAATATA 


ACAGTAAATC 


CATAAGTATC 


TACTTAAAAT ATAATCTCTG GCCAAACCAA 


3780 


GACTTATTAT 


TAGGATCTTC 


AAGAGAAAGT 


GCTGAGATAA TTCACTAAGT 


ATCAGAGATG 


3840 


ACCTTTATTA CATGATTGCC TGATAGAAAA AATGATTACA GACACACAAA AAAATCTTCA 


3900 


GTTGCTTAAT 


TTAAGCGCTG 


ACTCTCAACA 


GTTAAGTAAT AAAAGAGTTA 


AGCCTGCTOT 


3960 


GTATTTAGAA 


TATGTGAATA 


CCTATTGAAA 


GAATTTATTG TACAATTAAT ATAAACAGAC 


4020 


TTCTATTTTA 


CATCATAAGA 


TACTACTTAA 


TTTGTTAAAA ATTATTTTTT 


ATACATTGTT 


4080 


GTAAATACAA 


AGTGATATTT 


CTAATGATTA 


CAAGGCTGTC TGGCTAACTT 


ACGTTATGTT 


4140 


CAGGAGAAGA 


CAGTCCTTTT 


TAAGGAATGG 


GCACTTTCTA ACTTTTTTTC 


TCTAGGATGG 


4200 


AGAAAAATTA 


6CCTTCTTCC 


TACTTTAAAA ATGTTAGACA TAGAATTAAG 


GGATTGTTAT 


4260 


TTTGAGATTA 


AATTTTCTTT 


TCTCCTATTA 


TTTTTCCTCA TTCTGGAATG 


GAAGCAAAAG 


4320 


ATGAAGAAAG 


AAATATATGT 


TAAATTGTTT 


TCCTTTAAAT GAACACAAAT 


GTGAAATATG 


4380 


TTTTTCTGCC 


TATCTTGTAA 


AATTTTCTAT 


TGCAACTATT CTGATTACCA 


GTTCAAATGG 


4440 
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GGAAAAAAGA ACATAGGCTA CCCGAGACTT GAAATTTTGA AATATGAATG TCCTCTGTCT 4500 

CTAGCTGAGT ACTCTGGCGC TTCCAAAATG GAAACCTTTA AAGGGCCACT GTAAATTACA 4560 

GCTGCTAATT CCTGGTGCCA ATGGTGATAA GTGTTTACTA AACCTAGTGA GTACTTTATA 4620 

GCATGGGTCT GCTGCGAA6T AACATTGCTG TATATTTTCA GTCATTCTAC CTTAATTCAT 4680 

GAACTGCAAA ACTCTCATCT AG CTTT T T AC TTCTCTAGCT ATTGCTTTAA GTTCTATCAG 4740 

GCTCAGGTGT GG 4752 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i> SEQUENCE CHARACTERISTICS t 

(A) LENGTH: 1236 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESSs double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPEs DNA (genomic) 
(iii) HYPOTHETICAL I NO 
(iii) ANTI-SENSE X NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 



AAGCTTTGTA 


GAGTAATCAT 


GTATTCCAAA 


CTCAGGCTTA 


CATTTGAATG 


TTGGCTACAT 


60 


ATGTATGAGT 


TTTCAACTTC 


CAGGAGAAAA 


CGTCTCTTTA 


AAAGAGAACA 


ACCAAAAGCT 


120 


AACAGAAAAT 


ACAAGTGTGA 


CATTGGCCTT 


AGTTCGACCA 


AGAAAGCAAT 


TCATCTTGTT 


180 


TCTTCCTTTG 


TGGTATACAG 


ATAAGAAAAA 


TAAAATCACT 


ACAACGAAAG 


CAAAATCTCT 


240 


TCAGCGTCTC 


TAATACATCT 


TCCAAATCAG 


TGTGTCTGAC 


CTTTTCTTAA 


GACTTTAACC 


300 


ATCACAAGGA 


AACCAGTGGG 


GAGGGAGTCA 


TGTGCTGCCT 


AGTAGTTAAA 


GGGCAGGAGA 


360 


ATTCACTGGT 


GTGAGAAGGG 


ATTAGTGAGA 


GCTGGAAGAG 


AGGACCAGCC 


CCTCCCAGTG 


420 


TGAGGAATCT 


GGCTTGGGAT 


TTACTGTCTG 


GCAGAAAATC 


TCTTCGGGCA 


ATTAACAGCT 


480 


GGCATCAGGG 


GAAAAGCAGA 


CATCCAACAA 


CACTAGCTCT 


GAAGGAGATC AGCAGAGAAA 


540 


CCTTCCAGGG 


ATTCATGGTA 


CTGGTGAGCA 


GCTCTGTGGT 


GGGTACCCTT 


GTGACCAAAG 


600 


CTCTAGGAAC 


ATGAAGGAGA 


TTTGCTTGCT 


ATAAACCTGT 


TTCCTATTCT 


CCTTTCATTT 


660 


CCATGGTTAA 


CTATTACTAT 


GGTAGTCACC 


AACTAGTGGA 


TGCTTTTGGT 


AAATGACATC 


720 


TATGGAAAGT 


CTTTTTGGAT 


CAGGGTGATC 


TTTTTATGTA 


TGTGTATGTG 


CATGGATATG 


780 


GGTGCACGAG 


AGCAGGTGCC 


CAGATTCTCA 


AGGAGGGCTT 


CAGTTACAAG 


GAGTTGGGAG 


840 


TGATCTGATG 


TGGTTGCAAG 


GCACTGAAGT 


CAGTCTCTCT 


GTAAGAGCAC 


TCTATGCTCC 


900 


TTACCACTGT 


GCCTTCTCCC 


CAGCCCAAG A 


ATAGTATTCT 


TATGGGTAGA 


AATTTAAATA 


960 


AGAAACTCAA 


AGACCAGGAG 


AGTGAGTTCT 


GTCATCTAGC 


TATTATGCCT 


GCAGATATTT 


1020 


AAAGGTGAAT 


AATTATTTTG 


ACTATTGTTT 


AGAAATGTTG 


TTTCACATGA 


AAGATTCCAT 


1080 
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TTCCGGAGTG GGTTGAAAAG TATGCAAAAG AACTTTTGCA ACTCTGTTTT TGCCTTTCTG 1140 
TTTTTCAGCT GTATTTTCAT CTGAGCACCC CTGTCTTCTC CATGCAAAGA GCAGCATAGG 1200 
AGACCTGTGT TCTGAACTCT TGCTTCGAGA AGAATG 1236 
(2) INFORMATION FOR SEQ ID NO* 9: 

(i) SEQUENCE CHARACTERISTICS I 

(A) LENGTH: 5737 base pairs 

(B) TYPBs nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE l DNA (genomic) 
(iii) HYPOTHETICAL x NO 
(iii) ANTI-SENSE * NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

TTAATCAACA AATCTAAACA TTTATTCTTT TCATCTGTTT ACTCTTGCTC TTGTTCACCA 60 

CAATATGCTA TTCACATGTT CAGTGTAGTT TTATGACAAA GAAAATTTTC TGAGTTACTT 120 

TTGTATCCCC ACCCCCTTAA AGAAAGGAGG AAAAACTGTT TCATACAGAA GGCGTTAATT 180 

GCATGAATTA GAGCTATCAC CTAAGTGTGG GCTAATGTAA GAAAGAGGGA TTTCACCTAC 240 

ATCCATTCAG TCAGTCTTTG GGGGTTTAAA GAATTCCAAA GAGTCATCAG AAGAGGAAAA 300 

ATGAAGGTAA TGTTTTTTCA GACAGGTAAA GTCTTTGAAA ATATGTGTAA TATGTAAAAC 360 

ATTTTGACAC CCCCATAATA TTTTTCCAGA ATTAACAGTA TAAATTGCAT CTCTTGTTCA 420 

AGAGTTCCCT ATCACTCTCT TTAATCACTA CTCACAGTAA CCTCAACTCC TGCCACAATG 480 

TACAGGATGC AACTCCTGTC TTGCATTGCA CTAAGTCTTG CACTTGTCAC AAACAGTGCA 540 

CCTACTTCAA GTTCTACAAA GAAAACACAG CTACAACTGG AGCATTTACT GCTGGATTTA 600 

CAGATGATTT TGAATGGAAT TAATGTAAGT ATATTTCCTT TCTTACTAAA ATTATTACAT 660 

TTAGTAATCT AGCTGGAGAT CATTTCTTAA TAACAATGCA TTATACTTTC TTAGAATTAC 720 

AAGAATCCCA AACTCACCAG GATGCTGACA TTTAAGTTTT ACATGCCCAA GAAGGTAAGT 780 

ACAATATTTT ATGTTCAATT TCTGTTTTAA TAAAATTCAA AGTAATATGA AAATTTGCAC 840 

AGATGGGACT AATAGCAGCT CATCTGAGGT AAAGAGTAAC TTTAATTTGT TTTTTTGAAA 900 

ACCCAAGTTT GATAATGAAG CCTCTATTAA AACAGTTTTA CCTATATTTT TAATATATAT 960 

TTGTGTGTTG GTGGGGGTGG GAAGAAAACA TAAAAATAAT ATTCTCACCT TTATCGATAA 1020 

GACAATTCTA AACAAAAATG TTCATTTATG GTTTCATTTA AAAATGTAAA ACTCTAAAAT 1080 

ATTTGATTAT GTCATTTTAG TATGTAAAAT ACCAAAATCT ATTTCCAAGG AGCCCACTTT 1140 

TAAAAATCTT TTCTTGTTTT AGGAAAGGTT TCTAAGTGAG AGGCAGCATA ACACTAATAG 1200 

CACAGAGTCT GGGGCCAGAT ATCTGAAGTG AAATCTCAGC TCTGCCATGT CCTAGCTTTC 1260 
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ATGATCTTTG GCAAATTACC TACTCTGTTT GTGATTCAGT TTCATGTCTA CTTAAATGAA 1320 

TAACTGTATA TACTTAATAT GGCTTTGTGA GAATTAGTAA GTTAAATGTA AAGCACTCAG 1380 

AACCGTGTCT GGCATAAGGT AAATACCATA CAAGCATTAG CTATTATTAG TAGTATTAAA 1440 

GATAAAATTT TCACTGAGAA ATACAAAGTA AAATTTTGGA CTTTATCTTT TTACCAATAG 1500 

AACTTGAGAT TTATAATGCT ATATGACTTA TTTTCCAAGA TTAAAAGCTT CATTAGGTTG 1560 

TTTTTGGATT CAGATAGAGC ATAAGCATAA TCATCCAAGC TCCTAGGCTA CATTAGGTGT 1620 

GTAAAGCTAC CTAGTAGTTG TGCCAGTTAA GAGAGAATGA AGAAAATCTG GTGCCAGAAA 1680 

GAGCTTGTGC CAGGGTGAAT CCAAGCCCAG AAAATAATAG GATTTAAGGG GACACAGATG 1740 

CAATCCCATT GACTCAAATT CTATTAATTC AAGAGAAATC TGCTTCTAAC TACCCTTCTO 1800 

AAAGATGTAA AGGAGACAGC TTACAGATGT TACTCTAGTT TAATCAGAGC CACATAATGC 1860 

AACTCCAGCA ACATAAAGAT ACTAGATGCT GTTTTCTGAA GAAAATTTCT CCACATTGTT 1920 

CATGCCAAAA ACTTAAACCC GAATTTGTAG AATTTGTAGT GGTGAATTGA AAGCGCAATA 1980 

GATGGACATA TCAGGGGATT GGTATTGTCT TGACCTACCT TTCCCACTAA AGAGTGTTAO 2040 

AAAGATGAGA TTATGTGCAT AATTTAGGGG GTGGTAGAAT TCATGGAAAT CTAAGTTTGA 2100 

AACCAAAAGT AATG ATAAAC TCTATTCATT TGTTCATTTA ACCCTCATTG CACATTTACA 2160 

AAAGATTTTA GAAACTAATA AAAATATTTG ATTCCAAGGA TGCTATGTTA ATGCTATAAT 2220 

GAGAAAGAAA TGAAATCTAA TTCTGGCTCT ACCTACTTAT GTGGTCAAAT TCTGAGATTT 2280 

AGTGTGCTTA TTTATAAAGT GGAGATGATA CTTCACTGCC TACTTCAAAA GATGACTGTG 2340 

AGAAGTAAAT GGGCCTATTT TGGAGAAAAT TCTTTTAAAT TGTAATATAC CATAGAAATA 2400 

TGAAATATTA TATATAATAT AGAATCAAGA GGCCTGTCCA AAAGTCCTCC CAAAGTATTA 2460 

TAATCTTTTA TTTCACTGGG ACAAACATTT TTAAAATGCA TCTTAATGTA GTGATTGTAG 2520 

AAAAGTAAAA TTTAAGACAT ATTTAAAAAT GTGTCTTGCT CAAGGCTATA TTGAGAGCCA 2580 

CTACTACATG ATTATTGTTA CCTAGTGTAA AATGTTGGGA TTGTGATAGA TGGCATTCAA 2640 

GAGTTCCTTC TCTCTCAACA TTCTGTGATT CTTAACTCTT AG ACTATCAA AT ATTATAAT 2700 

CATAGAATGT GATTTTTATG CTTCCACATT CTAACTCATC TGGTTCTAAT GATTTTCTAT 2760 

GCAGATTGGA AAAGTAATCA GCCTGCATCT GTGATAGGCA CTTACGATGC AGAAAGTCTA 2820 

ACATTTTGCA AAGCCAAATT AAGCTAAAAC CAGTGAGTCA ACTATCACTT AACGCTAGTC 2880 

ATAGGTACTT GAGCCCTAGT TTTTCGAGTT TTATAATGTA AACTCTACTG GTCCATTCTT 2940 

TACAGTGACA TTGAGAACAG AGAGAATGGT AAAAACTACA TACTGCTACT CCAAATAAAA 3000 

TAAATTGGAA ATTAATTTCT GATTCTGACC TCTATGTAAA CTGAGCTGAT GATAATTATT 3060 

ATTCTAGGCC ACAGAACTGA AACATCTTCA GTGTCTAGAA GAAGAACTCA AACCTCTGGA 3120 

GGAAGTGCTA AATTTAGCTC AAAGCAAAAA CTTTCACTTA AGACCCAGGG ACTTAATCAG 3180 

CAATATCAAC GTAATAGTTC TGGAACTAAA GGTAAGG CAT TACTTTATTT GCTCTCCTGG 3240 
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AAATAAAAAA AAAAAAGTCA GGGGGAAAAG TACCACATTT TAAAGTGACA TAACATTTTT 3300 
GCTATTTGTA AAGTACCCAT GCATGTAATT AGCCTACATT TTAAGTACAC TGTGAACATG 3360 
AATCATTTCT AATGTTAAAT GATTAACTGG GGAGTATAAG CTACTGAGTT TGCACCTACC 3420 
ATCTACTAAT GGACAAGCCT CATCCCAAAC . TCCATCACCT TTCATATTAA CACAAAACTG 3480 
GGAGTGAGA6 AAGGTACTGA GTTGAGTTTC ACAGAAAGGA GGCAGATTTT ATTATATATT 3540 
TTTCGATTCT TCAGATCATT TACTGAAATA GCCAATACTG ATTACCTGAA AGGCTTTTCA 3600 
AATGGTGTTT CCTTATCATT TGATGGAAGG ACTACCCATA AGAGATTTGT CTTAAAAAAA 3660 
AAAACTGGAG CCATTAAAAT GGCCAGTGGA CTAAACAAAC AACAATCTTT TTAGAGGCAA 3720 
TCCCCACTTT CAGAATCTTA AGTATTTTTA AATGCACAGG AAGCATAAAA TATGCAAGGG 3780 
ACTCAGGTGA TGTAAAAGAG ATTCACTTTT GTCTTTTTAT ATCCCGTCTC CTAAGGTATA 3840 
AAATTCATGA GTTAATAGGT ATCCTAAATA AGCAGCATAA GTATAGTAGT AAAAGACATT 3900 
CCTAAAAGTA ACTCCAGTTG TGTCCAAATG AATCACTTAT TAGTGGACTG TTTCAGTTGA 3960 
ATTAAAAAAA TACATTGAGA TCAATCTCAT CTAGACATTG ACAGATTCAG TTCCTTATCT 4020 
ATGGCAAGAG TTTTACTCTA AAATAATTAA CATCAGAAAA CTCATTCTTA ACTCTTGATA 4080 

CAAATTTAAG ACAAAACCAT GCAAAAATCT GAAAACTGTG TTTCAAAAGC CAAACACTTT 4140 

TTAAAATAAA AAATCCCAAG ATATGACAAT ATTTAAACAA TTATGCTTAA GAGGATACAG 4200 

AACACTGCAA CAGTTTTTTA AAAGAGAATA CTTATTTAAA GGGAACACTC TATCTCACCT 4260 

GCTTTTGTTC CCAGGGTAGG AATCACTTCA AATTTGAAAA GCTCTCTTTT AAATCTCACT 4320 

ATATATCAAA ATATTTCCTC CTTAGCTTAT CAACTAGAGG AAGCGTTTAA ATAGCTCCTT 4380 

TCAGCAGAGA AGCCTAATTT CTAAAAAGCC AGTCCACAGA ACAAAATTTC TAATGTTTAA 4440 

ACTTTTAAAA GTTGGCAAAT TCACCTGCAT TGATACTATG ATGGGGTAGG GATAGGTGTA 4500 

AGTATTTAGA AGATGTTCTT CACACAAATT TATCCCAAAC GGAAGCATGT CCTAGCTTAC 4560 

TCTAGTGTAG TTCTGTTCTG CTTTGGGGAA AATATAAGGA GATTCACTTA AGTAGAAAAA 4620 

TAGGAGACTC TAATCAAGAT TTAGAAAAGA AGAAAGTATA ATGTGCATAT CAATTCATAC 4680 

ATTTAACTTA CACAAATATA GGTGTACATT CAGAGGAAAA GCGATCAAGT TTATTTCACA 4740 

TCCAGCATTT AATATTTGTC TAGATCTATT TTTATTTAAA TCTTTATTTG CACCCAATTT 4800 

AGGGAAAAAA TTTTTGTGTT CATTGACTGA ATTAACAAAT GAGGAAAATC TCAGCTTCTG 4860 

TGTTACTATC ATTTGGTATC ATAACAAAAT ATGTAATTTT GGCATTCATT TTGATCATTT 4920 

CAAGAAAATG CGAATAATTA ATATGTTTGG TAAGCTTGAA AATAAAGGCA ACAGGCCTAT 4980 

AAGACTTCAA TTGGGAATAA CTGTATATAA GGTAAACTAC TCTGTACTTT AAAAAATTAA 5040 

CATTTTTCTT TTATAGGGAT CTGAAACAAC ATTCATGTGT GAATATGCTG ATGAGAGAGC 5100 

AACCATTGTA GAATTTCTGA ACAGATGGAT TACCTTTTGT CAAAGCATCA TCTCAACACT 5160 

GACTTGATAA TTAAGTGCTT CCCACTTAAA ACATATCAGG CCTTCTATTT ATTTAAATAT 5220 
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TTAAATTTTA TATXTATT6T TGAATGTATG GTTTGCTACC TATTGTAACT ATTATTCTTA 5280 

ATCTTAAAAC TATAAATAT6 6ATCTTTTAT GA TT CT T TT T GTGCCCTAGO GGCTCTAAAA 5340 

TGGTTTCACT TATTTATCCC AAAATATTTA TTATTATGTT GAATGTTAAA TATAGTGCTA 5400 

TGTAGATTGG TTAGTAAAAC TATTTAATAA ^TTTGATAAA TATAAACAAG CCTGGATATT 5460 

TGTTATTTTG GAAACAGGAC AGAGTAAGCA TTTAAATATT TCTTAGTTAC TTGTGTGAAC 5520 

TGTAGGATGG TTAAAATGCT TACAAAAGTC ACTCTTTCTC TGAAGAAATA TGTAGAACAG 5580 

AGATGTAGAC TTCTCAAAAG CCCTTGCTTT GTCCTTTCAA GGGCTGATCA GACCCTTAGT 5640 

TCTGGCATCT CTTAGCAGAT TATATTTTCC TTCTTCTTAA AATGCCAAAC ACAAACACTC 5700 

TTGAAACTCT TCATAGATTT GGTGTGGCTA TGAATTC 5737 
(2) INFORMATION FOR SEQ ID NO: 10s. 

(1) SEQUENCE CHARACTERISTICS : 

(A) LENGTH t 614 baee pairs 

(B) TYPEt nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(li> MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL 5 NO 
(iii) ANTI-SENSE t NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10s 



GATCGTTAGC 


TTCTCCTGAT 


AAACTAATTG 


CCTCACATTG 


TCACTGCAAA 


TCGACACCTA 


60 


TTAATGGGTC 


TCACCTCCCA 


ACTGCTTCCC 


CCTCTGTTCT 


TCCTGCTAGC 


ATGTGCCGGC 


120 


AACTTTGTCC 


ACGGACACAA 


GTGCGATATC 


ACCTTACAGG 


AGATCATCAA 


AACTTTGAAC 


180 


AGCCTCACAG 


AGCAGAAGAC 


TCTGTGCACC 


GAGTTGACCG 


TAACAGACAT 


CTTTGCTGCC 


240 


TCCAAGAACA 


CAACTGAGAA 


GGAAACCTTC 


TGCAGGGCTG 


CGACTGTGCT 


CCGGCAGTTC 


300 


TACAGCCACC 


ATGAGAAGGA 


CACTCGCTGC 


CTGGGTGCGA 


CTGCACAGCA 


GTTCCACAGG 


360 


CACAAGCAGC 


TGATCCGATT 


CCTGAAACGG 


CTCGACAGGA 


ACCTCTGGGG 


CCTGGCGGGC 


420 


TTGAATTCCT 


GTCCTGTGAA 


GGAAGCCAAC 


CAGAGTACGT 


TGGAAAACTT 


CTTGGAAAGG 


480 


CTAAAGACGA 


TCATGAGAGA 


GAAATATTCA 


AAGTGTTCGA 


GCTGAATATT 


TTAATTTAT6 


540 


AGTTTTTGAT 


AGCTTTATTT 


TTTAAGTATT 


TATATATTTA 


TAACTCATCA 


TAAAATAAAG 


600 


TATATATAGA 


ATCT 










614 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1589 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(11) MOLECULE TYPE: cDNA to mRNA 
(111) HYPOTHETICAL: NO 
(111) ANTI-SENSE I NO 



(xl) SEQUENCE DESCRIPTION: SEQ ID NO 3 lis 

GAATTCCTCT GGTCCTCATC CA66T6C6C6 GGAAGCAG6T GCCGAGGAGA GAGGGGATAA 60 

TGAAGATTCC ATGCTGATGA TCCCAAAGAT TGAACCTGCA GACCAAGCGC AAAGTAGAAA 120 

CTGAAAGTAC ACTGCTGGCG GATCCTACGG AAGTTATGGA AAAGGCAAAG CGCAGAGCCA 180 

CGCCGTAGTG TGTGCCGCCC CCCTTCGGAT GGATGAAACT GCAGTCGCGG CGTGGGTAAG 240 

AGGAACCAGC TGCAGAGATC ACCCTGCCCA ACACAGACTC GGCAACTCCG CGGAAGACCA 300 

GGGTCCTGGG AGTGACTATG GGCGGTGAGA GCTTGCTCCT GCTCCAGTTG CGGTCATCAX 360 

GACTACGCCC tJCCTCCCGCA GACCATGTTC CATGTTTCTT TTAGGTATAT CTTTGGACTT 420 

CCTCCCCTGA TCCTTGTTCT GTTGCCAGTA GCATCATCTG ATTGTGATAT TGAAGGTAAA 480 

GATGGCAAAC AATATGAGAG TGTTCTAATG GTCAGCATCG ATCAATTATT GGAGAGGATG 540 

AAAGAAATTG GTAGCAATTG CCTGAATAAT GAATTTAACT TTTTTAAAAG ACATATCTGT 600 

GATGCTAATA AGGAAGGTAT GTTTTTATTC CGTGCTGCTC GCAAGTTGAG GCAATTT CT T 660 

AAAATGAATA GCACTGGTGA TTTTGATCTC CACTTATTAA AAGTTTCAGA AGGCACAACA 720 

ATACTGTTGA ACTGCACTGG CCAGGTTAAA GGAAGAAAAC CAGCTGCCCT GGGTGAAGCC 780 

CAACCAACAA AGAGTTTGGA AGAAAATAAA TCTTTAAAGG AACAGAAAAA ACTG AATGAC 840 

TTGTGTTTCC TAAAGAGACT ATTACAAGAG ATAAAAACTT GTTGGAATAA AATTTTGATG 900 

GGCACTAAAG AACACTGAAA AATATGGAGT GGCAATATAG AAACACGAAC TTTAGCTGGA 960 

TCCTCCAAGA ATCTATCTGC TTATGCAGTT TTTCAGAGTG GAATGCTTCC TAGAAGTTAC 1020 

TGAATGCACC ATGGTCAAAA CGGATTAGGG CATTTGAGAA ATGCATATTG TATTACTAGA 1080 

AGATGAATAC AAACAATGGA AACTGAATGC TCCAGTCAAC AAACTATTTC TTATATATGT 1140 

GAACATTTAT CAATCAGTAT AATTCTGTAC TGATTTTTGT AAGACAATCC ATGTAAGGTA 1200 

TCAGTTGCAA TAATACTTCT CAAACCTGTT TAAATATTTC AAGACATTAA ATCTATGAAG 1260 

TATATAATGG TTTCAAAGAT TCAAAATTGA CATTGCTTTA CTGTCAAAAT AATTTTATGG 1320 

CTCACTATGA ATCTATTATA CTGTATTAAG AGTGAAAATT GTCTTCTTCT GTGCTGGAGA 1380 

TGTTTTAGAG TTAACAATGA TATATGGATA ATGCGGGTGA GAATAAGAGA GTCATAAACC 1440 

TTAAGTAAGC AACAGCATAA CAAGGTCCAA GATACCTAAA AGAGATTTCA AGAGATTTAA 1500 

TTAATCATGA ATGTGTAACA CAGTGCCTTC AATAAATGGT ATAGCAAATG TTTTGACATG 15*60 

AAAAAAGGAC AATTTCAAAA AAATAAAAT 1589 
(2) INFORMATION FOR SEQ ID NO: 12: 
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(i) SEQUENCE CHARACTERISTICS t 

(A) LENGTH t 1585 base pairs 

(B) TYPE i nucleic acid 

(C) STRANDEDNESSx double 

(D) TOPOLOGY i linear 

(ii) MOLECULE TYPE: cDNA to roRNA 
(iii) HYPOTHETICAL: HO 
(iii) ANTI-SENSE X NO 

(xi) SEQUENCE DESCRIPTION t SEQ ID NO: 12 S 

CACACCCTGA CAAGCTGCCA G6CAGGTTCT CTTCCTCTCA CATACTGACC CACGGCTCCA 60 

CCCTCTCTCC CCT6GAAAG6 ACACCATGAO CACTGAAAGC ATGATCCGGG ACGTGGAGCT 120 

6GCC6AGGA6 GCGCTCCCCA AGAAGACAGG GGGG CCCCAG GGCTCCAGGC GGTGCTTGTT 180 

CCTCAGCCTC TTCTCCTTCC TGATCGTGGC AGGCGCCACC ACGCTCTTCT GCCTGCTGCA 240 

CTTTGGAGTG ATCGGCCCCC AGAGGGAAGA GTCCCCGAGG GACCTCTCTC TAATCAGCCC 300 

TCTGGCCCAO GCAGTCAGAT CATCTTCTCG AACCCCGAGT GACAAGCCTG TAGCCCATGT 360 

TGTAGCAAAC CCTCAAGCTG AGGGGCAGCT CCAGTGGCTG AACCGCCGGG CCAATGCCCT 420 

CCTGGCCAAT GGCGTGGAGC TGAGAGATAA CCAGCTGGTG GTGCCATCAG AGGGCCTGTA 480 

CCTCATCTAC TCCCAGGTCC TCTTCAAGGG CCAAGGCTGC CCCTCCACCC ATGTGCTCCT 540 

CACCCACACC ATCAGCCGCA TCGCCGTCTC CTACCAGACC AAGGTCAACC TCCTCTCTGC 600 

CATCAAGAGC CCCTGCCAGA GGGAGACCCC AGAGGGGGCT GAGGCCAAGC CCTGGTATGA 660 

GCCCATCTAT CTGGGAGGGG TCTTCCAGCT GGAGAAGGGT GACCGACTCA G CGCTGAG AT 720 

CAATCGGCCC GACTATCTCG ACTTTGCCGA GTCTGGGCAG GTCTACTTTG GGATCATTGC 780 

CCTGTGAGGA GGACGAACAT CCAACCTTCC CAAACGCCTC CCCTGCCCCA ATCCCTTTAT 840 

TACCCCCTCC TTCAGACACC CTCAACCTCT TCTGGCTCAA AAAGAGAATT GGGGGCTTAG 900 

GGTCGGAACC CAAGCTTAGA ACTTTAAGGA ACAAGACCAC CACTTCGAAA CCTGGGATTC 960 

AGGAATGTGT GGCCTGCACA GTGAAGTGCT GGCAACCACT AAGAATTCAA ACTGGGGCCT 1020 

CCAGAACTCA CTGGGGCCTA CAGCTTTGAT CCCTGACATC TGGAATCTGG AG ACCAGGG A 1080 

GCCTTTGGTT CTGGCCAGAA TGCTGCAGGA CTTGAGAAGA CCTCACCTAG AAATTGACAC 1140 

AAGTGGACCT TAGGCCTTCC TCTCTCCAGA TGTTTCCAGA CTTCCTTGAG ACACGGAGCC 1200 

CAGCCCTCCC CATGGAGCCA GCTCCCTCTA TTTATGTTTG CACTTGTGAT TATTTATTAT 1260 

TTATTTATTA TTTATTTATT TACAGATGAA TGTATTTATT TGGGAGACCG GGGTATCCTG 1320 

GGGGACCCAA TGTAGGAGCT GCCTTGGCTC AGACATGTTT TCCGTGAAAA CGGAGGCTGA 1380 

ACAATAGGCT GTTCCCATGT AGCCCCCTGG CCTCTGTGCC TTCTTTTGAT TATGTTTTTT 1440 

AAAATATTAT CTGATTAAGT TGTCTAAACA ATGCTGATTT GGTGACCAAC TGTCACTCAT 1500 
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TGCTGACGCC TCTGCTCCCC AGGGAGTTGT GTCTGTAATC GGCCTACTAT TCAGTGGCGA 1560 
GAAATAAAGG TTGCTTAGGA AAGAA 1585 
(2) INFORMATION FOR SEQ 10 NO* 13 s 

(i) SEQUENCE CHARACTERISTICS £ 

(A) LENGTHS 30 baae pairs 

(B) TYPBs nucleic acid 

(C) STRAND ED NESS s single 

(D) TOPOLOGY s linear 

(ii) MOLECULE TYPEs DNA (genomic) 
(iii) HYPOTHETICAL s NO 
(iii> ANTI-SENSE s NO 



(xi) SEQUENCE DESCRIPTION s SEQ ID NO: 13 s 
CGGAATTTCA TGCCCCAGTT GACAACATAG 
(2) INFORMATION FOR SEQ ID NOs 14s 

(i) SEQUENCE CHARACTERISTICS s 

(A) LENGTHS 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS * single 

(D) TOPOLOGY: linear 

(£i) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL s NO 
(iii) ANTI-SENSE s NO 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 14: 
CACTCGAGAA CTTTTTCTCC TTTAGATCAT ACAA 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTHS 30 base pairs 

(B) TYPEs nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
CGGGAATTCA TGCCCCAGTT GACAACATAG 
(2) INFORMATION FOR SEQ ID NO: 16: 



30 
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(i) SEQUENCE CHARACTERISTICS l 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPES DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE : NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
GAGCTCGAGT GTCACAGACT TCTTTTCCA 29 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
AAACGAATTC CATCCAGTAA GTCCATTACT 30 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
GAGCTCGAGT GTCACAGACT TCTTTC 26 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



WO 94/04196 



PCT/GB93/01730 



79 

(ii) MOLECULE TYPE i DNA (genomic) 
(iii) HYPOTHETICAL f HO 
(iii) ANTI-SENSE 3 NO 

(xi) SEQUENCE DESCRIPTION * SEQ ID NOs 19 : 
6C6GCCGC6C ATGTACA6CA TGCAGCTCCC A 
(2) INFORMATION FOR SEQ ID NO* 20s 

(i) SEQUENCE CHARACTERISTICS S 

(A) LENGTH: 30 base pairs 

(B) TYPEs nucleic acid 

(C) STRANDEDNESS s single 

(D) TOPOLOGY s linear 

(ii) MOLECULE TYPEs DNA (genomic) 
(iii) HYPOTHETICAL s NO 
(iii) ANTI-SENSE s NO 



(xi) SEQUENCE DESCRIPTIONS SEQ ID NOs 20s 
GOGGCCGCTA AATAAATAGA GACCCTTATG 
(2) INFORMATION FOR SEQ ID NO: 21s 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTHS 1011 base pairs 

(B) TYPEs nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPEs cDNA to mRNA 
(iii) HYPOTHETICAL S NO 
(iii) ANTI-SENSE s NO 



(xi) SEQUENCE DESCRIPTIONS SEQ ID NOs 21s 
GGTTTATTTT CCAGATGCAA TCAATGCCCC AGTCACCTGC TGTTATAACT TCACCAATAG 
GAAGATCTCA GTGCAGAGGC TCGCGAGCTA TAGAAGAATC ACCAGCAGCA AGTGTCCCAA 
ACAAGCTGTG ATGTGAGTTC AGCACACCAA CCTTCCCTGG CCTGAAGTTC TTCCTTGTGG 
AGCAAGGGAC AAGCCTCATA AACCTAGAGT CAGAGAGTGC ACTATTTAAC TTAATGTACA 
AAGGTTCCCA ATGGGAAAAC TGAGGCACCA AGGGAAAAAG TGAACCCCAA CATCACTCTC 
CACCTGGGTG CCTATTCAGA ACACCCAATT TCTTTAGCTT GAAGTCAGGA TGGCTCCACC 
TGGACACCTA TAGGAGCAGT TTGCCCTGGG TTCCCTCCTT CCACCTGCGT TCCTCCTCTA 
GCTCCCATGG CAGCCCTTTG GTGCAGAATG GGCTGCACTT CTACACCAAA ACTGCAAAGG 



60 
120 
180 
240 
300 
360 
420 
480 
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AACTTCATCT 


AACTCTGTCC 


TCCCTCCCCA 


CAGCTTACAG ACCATTGTGG CAAGGAGATC 


540 




CCAAGCAGAA 


GTGGGTTCAG 


uniilWwniuv AWWiwUiwwn l*JUU*WnAAU*# 


DUU 


CAAACTCCGA 


AGACTTGAAC 


ACTCACTCCA 


WAAVWLAAUA AlLiVUAvUl AAWi iAl 111 


AAA 


X vvWlnwvl A 


TCCCCAGACA 


CCTTGTTTAT 


TTTATTATAA TGAATTTTGT TTGTTGATGT 




GAAACATTAT 


GCdTTAAGTA ATGTTAATTC 


TTATTTAAGT TATTGATGTT TTAAGTTTAT 


780 


CTTTCATGGT 


ACTAGTGTTT 


TTTAGATACA 


GAGACTTGGG GAAATTGCTT TTCCTCTTGA 


S40 


ACCACAGTTC 


TACCCCTGGG 


ATGTTTTGAG 


GGTCTTTGCA AGAATCATTA ATACAAAGAA 


900 


TTTTTTTTAA 


CATTCCAATG 


CATTGCTAAA ATATTATTGT GGAAATGAAT ATTTTGTAAC 


960 


TATTACACCA 


AATAAATATA TTTTTGTACA AAAAAAAAAA AAAAAAAAAA A 


1011 



(2) INFORMATION FOR SBQ ID NO* 22 5 

(1) SEQUENCE CHARACTERISTICS « 

(A) LENGTH: 3194 base pairs 

(B) TYPEs nucleic acid 

(C) STRANDEDNESS * double 

(D) TOPOLOGY* linear 

(ii) MOLECULE TYPES cDNA to roRNA 
(iii) HYPOTHETICAL I NO 
(iii) ANTI-SENSES NO 



(xi) SEQUENCE DESCRIPTIONS SEQ ID NO: 22s 



AAGCTTGCTG 


AGAGTGGCTG 


CAGTCTCGCT 


GCTGGATGTG 


CACATGGTGG TCATTCCCTC 


60 


TGCTCACAGG 


GGCAGGGGTC 


CCCCCTTACT 


GGACTCAGGT 


TGCCCCCTGC TCCAGGTCCT 


120 


GGGTGGGAGC 


CCATGTGAAC 


TGTCAGTGGG 


GCAGGTCTGT 


GAGAGCTCCC CTCACACTCA 


180 


AGTCTCTCAC 


AGTGGCCAGA 


GAAGAGGAAG 


GCTGGAGTCA 


GAATGAGGCA CCAGGGCGGG 


240 


CATAGCCTGC 


CCAAAGGCCC 


CTGGGATTAC 


AGGCAGGATG 


GGGAGCCCTA TCTAAGTGTC 


300 


TCCCACGCCC 


CACCCCAGCC ATTCCAGGCC 


AGGAAGTCCA 


AACTGTGCCC CTCAGAGGGA 


360 


GGGGGCAGCC 


TCAGGCCCAT 


TCAGACTGCC 


CAGGGAGGGC 


TGGAGAGCCC TCAGGAAGGC 


420 


GGGTGGGTGG 


GCTGTCGGTT 


CTTGGAAAGG 


TTCATTAATG 


AAAACCCCCA AGCCTGACCA 


480 


CCTAGGGAAA 


AGGCTCACCG 


TTCCCATGTG 


TGGCTGATAA 


GGGCCAGGAG ATTCCACAGT 


540 


TCAGGTAGTT 


CCCCCGCCTC 


CCTGGCATTT 


TGTGGTCACC 


ATTAATCATT TCCTCTGTGT 


600 


ATTTAAGAGC 


TCTTTTGCCA 


GTGAGCCCAG 


TACACAGAGA 


GAAAGGCTAA AGTTCTCTGG 


660 


AGGATGTGGC 


TGCAGAGCCT 


GCTGCTCTTG 


GGCACTGTGG 


CCTGCAGCAT CTCTGCACCC 


720 


GCCCGCTCGC 


CCAGCCCCAG 


CACGCAGCCC 


TGGGAGCATG 


TGAATGCCAT CCAGGAGGCC 


780 


CGGCGTCTCC 


TGAACCTGAG 


TAGAGACACT 


GCTGCTGAGA 


TGGTAAGTGA GAGAATGTGC 


840 


GCCTGTGCCT 


AGGCCACCCA 


GCTGGCCCCT 


GACTGGCCAC 


GCCTGTCAGC TTGATAACAT 


900 
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GACATTTTCC TTTTCTAGAG AATGAAACAG TAGAAGTCAT CTCAGAAATG TTTGACCTCC 960 

AGGTAAGATG CTTCTCTCTG ACATAGCTTT CCAGAAGCCC CTGCCCTGGO GTGGAGGTGG 1020 

GGACTCCATT TTAGATGGCA CCACACAGGG TTGTCCACTT TCTCTCCAGT CAGCTGGCTG 1080 

CAGGAGGAGG GGGTAGCAAC TGGGTGCTGA. AGAGGCTGCT GGCCGTGCCC CTATGGCAGT 1140 

CACATGAGCT CCTTTATCAG CTGAGCGGCC ATGGGCAGAC CTAGCATTCA ATGGCCAGGA 1200 

GTCACCAGGG GACAGGTGGT AAAGTGGGGG TCACTTCATG AGACAGGAGC TGTGGGTTTG 1260 

GGGCGCTCAC TGTGCCCCGA GACCAAGTCC TGTTGAGACA GTGCTGACTA CAGAGAGGGA 1320 

CAGACGGGTT TCAGGAACAA CCCTTGCCCA CCCAGCAGGT CCAGGTCAGG CCCCACCCCC 1380 

CTCTCCCTGA ATGATGGGGT GAGAGTGACC TCCTTCCCTA AGGCTGGGCT CCTCTCCAGG 1440 

TGCCGCTGAG GGTGGCCTGG GCGGGGCAGT GAGAAGGGGA GGTTCGTGCC TGCCATGGAC 1500 

AGGGGAGGGT CTATGACTGG ACCCACGCTG TGCCCCTCCC AAGCCCTACT CCTGGGGGCT 1560 

GGGGGCAGCA GCAAAAAGGA GTGGTGGAGA CTTCTTGTAC GACTGTGGGC ACTTGGCCAC 1620 

TGCTCACCGA CGAAOGACAT TTTCCACAGG AGCOGACCTG CCTACAGACp CGCCTGGAGC 1680 

TGTACAAGCA GGGCCTGCGG GGCAGCCTCA CCAAGCTCAA GGGCCCCTTG ACCATGATGG 1740 

CCAGCCACTA CAAGCAGCAC TGCCCTCCAA CCCCGGTGAG TGCCTACGGC AGGGCCTCCA 1800 

GCAGGAATGT CTTAATCTAG GGGGTGGGGT CGACATGGGG AGAGATCTAT GGCTGTGGCT I860 

GTTCAGGACC CCAGGGGGTT TCTGTGCCAA CAGTTATGTA ATGATTAGCC CTCCAGAGAG 1920 

GAGGGAGACA GCCCATTTCA TCCCAAGGAG TCAGAGCCAC AGAGCGCTGA AGCCCACAGT 1980 

GCTCCCCAGC AGGAGCTGCT CCTATCCTGG TCATTATTGT CATTACGGTT AATGAGGTCA 2040 

GAGGTGAGGG CAAACCCAAG GAAACTTGGG GCCTGCCCAA GGCCCAGAGG AAGTGCCCAG 2100 

GCCGAAGTOC CACCTTCTGG CAGGACTTTC CTCTGGCCCC ACATGGGGTG CTTGAATTGC 2160 

AGAGGATCAA GGAAGGGAGG CTACTTGGAA TGGACAAGGA CCTCAGGCAC TCCTTCTGOG 2220 

GGAAGGGAGC AAAGTTTGTG GCCTTGACTC CACTCCTTCT GGGTGCCCAG AGACGACCTC 2280 

AGCCCAGCTG CCCTGCTCTG CCCTGGGACC AAAAAGGCAG GCGTTTGACT GCCCAGAAGG 2340 

CCAACCTCAG GCTGGCACTT AAGTCAGGCC CTTGACTCTG GCTGCCACTG GCAGAGCTAT 2400 

GCACTCCTTG GGGAACACGT GGGTGGGAGC AGCGTCACCT GACCCAGGTC AGTGGGTGTG 2460 

TCCTGGAGTG GGCCTCCTGG CCTCTGAGTT CTAAGAGGCA GTAGAGAAAC ATGCTGGTGC 2520 

TTCCTTCCCC CACGTTACCC ACTTGCCTGG ACTCAAGTGT TTTTTATTTT TCTTTTTTTA 2580 

AAGGAAACTT CCTGTGCAAC CCAGATTATC ACCTTTGAAA GTTTCAAAGA GAACCTGAAG 2640 

GACTTTCTGC TTGTCATCCC CTTTGACTGC TGGGAGCCAG TCCAGGAGTG AGACCGGCCA 2700 

GATGAGGCTG GCCAAGCCGG CGAGCTGCTC TCTCATGAAA GAAGAGCTAG AAACTCAGGA 2760 

TGGTCATCTT GGAGGGACCA AGGGGTGGGC CACAGCCATG GTGGGAGTGG CCTGGACCTG 2820 

CCCTCGGCCA CACTGACCCT GATACAGGCA TGGCAGAAGA ATGGGAATAT TTTATACTGA 2880 
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CAGAAATCAG TAATATTTAT ATATTTATAT TTTTAAAATA TTTATTTATT TATTTATTTA 
AGTTCATATT CCATATTTAT TCAAGATGTT TTACCGTAAT AATTATTATT AAAAAT ATGC 
TTCTACTTGT CCAGTGTTCT AGTTTGTTTT TAACGATGAG CAAATGCCAG TGGGTGCCTG 
CCTTCCCATG AGGCAGGGGA GGGAGGAAAC GGGGAGGTGG AGAGGGGGOG GGGGCCTCCC 
AGGCGTTGGG CACTATCCAA GGGCCAACAC TGTCAGAGCA GAGGGGAGGT GAGAGCCGGG 
CATAGTCGGA ATTC 

(2) INFORMATION FOR SEQ ID NO: 23s 

(i) SEQUENCE CHARACTERISTICS* 

(A) LENGTHS 1491 base pairs 

(B) TYPE t nucleic acid 

(C) STRANDEDNESS t double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPEs cDNA to mRNA 
(iii) HYPOTHETICAL s NO 
(iil) ANTI-SENSE* NO 
(xi) SEQUENCE DESCRIPTIONS SEQ ID NO: 23* 
CCAAAGAAAA AGTGATTTGT CATTGCTTTA TAGACTGTAA GAAGAGAACA TCTCAGAACT 
GGAGTCTTAC CCTGAAATCA AAGGATTTAA AGAAAAAGTG GAATTTTTCT TCAGCAAGCT 
GTGAAACTAA ATCCACAACC TTTGGAGACC CAGGAACACC CTCCAATCTC TGTGTGTTTT 
GTAAACATCA CTGGAGGGTC TTCTACGTGA GCAATTGGAT TGTCATCAGC CCTGCCTGTT 
TTGCACCTGG GAAGTGCCCT GGTCTTACTT GGGTCCAAAT TGTTGGCTTT CACTTTTGAC 
CCTAAGCATC TGAAGCCATG GGCCACACAC GGAGGCAGGG AACATCACCA TCCAAGTGTC 
CATACCTCAA TTTCTTTCAG CTCTTGGTGC TGGCTGGTCT TTCTCACTTC TGTTCAGGTG 
TTATCCACGT GACCAAGGAA GTGAAAGAAG TGGCAACGCT GTCCTGTGGT CACAATGTTT 
CTGTTGAAGA GCTGGCACAA ACTCGCATCT ACTGGCAAAA GGAGAAGAAA ATGGTGCTGA 
CTATGATGTC TGGGGACATG AATATATGGC CCGAGTACAA GAACCGGACC ATCTTTGATA 
TCACTAATAA CCTCTCCATT GTGATCCTGG CTCTGCGCCC ATCTGACGAG GGCACATACG 
AGTGTGTTGT TCTGAAGTAT GAAAAAGACG CTTTCAAGCG GGAACACCTG GCTGAAGTGA 
CGTTATCAGT CAAAGCTGAC TTCCCTACAC CTAGTATATC TGACTTTGAA ATTCCAACTT 
CTAATATTAG AAGGATAATT TGCTCAACCT CTGGAGGTTT TCCAGAGCCT CACCTCTGCT 
GGTTGGAAAA TGGAGAAGAA TTAAATGCCA TCAACACAAC AGTTTCCCAA GATCCTGAAA 
CTGAGCTCTA TGCTGTTAGC AGCAAACTGG ATTTCAATAT GACAACCAAC CACAGCTTCA 
TGTGTCTCAT CAAGTATGGA CATTTAAGAG TGAATCAGAC CTTCAACTGG AATACAACCA 
AGCAAGAGCA TTTTCCTGAT AACCTGCTCC CATCCTGGGC CATTACCTTA ATCTCAGTAA 
ATGGAATTTT TGTGATATGC TGCCTGACCT ACTGCTTTGC CCCAAGATGC AGAGAGAGAA 



2940 
3000 
3060 
3120 
3180 
3194 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
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GGAGGAATGA GAGATTGAGA AGGGAAAGTG TAOGCCCTGT ATAACAGTGT CCGCAGAAGC 1200 

AAGGGGCTGA AAAGATCTGA AGGTAGCCTC CGTCATCTCX TCTGGGATAC ATGGATCGTC 1260 

GGGATCATGA GG CATTCTTC CCTTAACAAA TTTAAGCTGT TTTACCCACT ACCTCACCTT 1320 

CTTAAAAACC TCTTTCAGAT TAAGCTGAAC . AGTTACAAGA TGGCTGGCAT CCCTCTCCTT 1380 

TCTCCCCATA TGCAATTTGC TTAATGTAAC CTCTTCTTTT GCCATGTTTC CATTCTGCCA 1440 

TCTTGAATTG TCTTGTCAGC CAATTCATTA TCTATTAAAC ACTAATTTGA 6 1491 
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1. A DNA construct comprising (i) means for expression of a coding 
sequence in a tumour cell and <ii) a said coding sequence 

5 encoding a cytokine. 

2. A construct according to Claim 1 wherein the said means for 
expression provides for specific expression selectively in tumour 
cells. 

10 

3. A construct according to Claim 2 wherein the tumour cells are 
melanoma cells. 



4. A construct according to Claim 2 wherein the tumour cells are 
*5 breast tumour cells. 

5. A construct according to Claim 2 wherein the tumour cells are 
colon tumour cells. 

20 6. A construct according to Claim 2 wherein the tumour cells are 
pancreatic tumour cells. 

7. A construct according to Claim 2 wherein the tumour cells are 
prostate tumour cells. 

25 

8. A construct according to Claim 3 wherein the said means for 
expression is a promoter or an analogue or part thereof forming 
part of a gene expressed exclusively in the melanin synthesis 
pathway. 

30 
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9. A construct according to Claim 8 wherein the gene is tyrosinase 
or TRP-l. 

10* A construct according to Claim 4 wherein the said means for 
5 expression is provided by the c~erb-B2 gene promoter or the 

MUC1 gene promoter or the c-e/*-B3 gene promoter. 

11. A construct according to Claim 5 wherein the said means for 
expression is provided by the CEA gene promoter. 

10 

12. A construct according to Claim 6 wherein the said means for 
expression is provided by the MUC1 gene promoter. 

13. A construct according to Claim 7 wherein the said means for 
15 expression is provided by the PSA gene promoter. 

14. A construct according to any one of the preceding claims wherein 
the cytokine is interleukin-2 or interleukin-4. 

20 15. A construct according to any one of the preceding claims further 
comprising a B7 coding sequence and means for expression 
thereof in a tumour cell. 



16. A composition comprising a construct according to any one of the 
preceding claims and means for selectively delivering it to a 
tumour. 

17. A composition according to Claim 16 wherein the selective 
delivery means is a liposome carrying tumour cell targeting means 
or a retrovirus or adenovirus specific for the tumour cells. 
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18. A method of treating a tumour and/or ameliorating metastasis 
therefrom comprising delivering into cells of the tumour a 
construct according to any one of Claims 1 to 15. 

5 19. A method of treating a tumour and/or ameliorating metastasis 
therefrom comprising delivering into cells of the tumour a 
construct according to any one of Claims 1 to 15 expressing at 
least two coding sequences encoding respective cytokines wherein 
the said cytokines may be the same as or different from one 
10 another. 

20. A method of treating a tumour and/or ameliorating metastasis 
therefrom comprising delivering into cells of the tumour a 
plurality of constructs according to any one of Claims 1 to 15 

15 expressing at least two coding sequences encoding respective 

cytokines wherein the said cytokines may be the same as or 
different from one another. 

21 . A method according to Claims 19 or 20 wherein the cytokines are 
20 chosen from interleukin-2, interleukin-4, macrophage colony 

stimulating factor, interferon-7, tumour necrosis factor and 
interleukin-7. 

22. A method according to Claims 19 or 20 wherein the coding 
25 sequences encode interleukin-2, interleukin-4 and macrophage 

colony stimulating factor and are present in 1:1:1 molar ratio. 

23. A method according to any one of Claims 18 to 20 wherein the 
tumour cells are melanoma, breast, pancreas, prostate or colon 

30 cells and naked DNA is injected directly into the tumour. 
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A method according to any one of Claims 18 to 23 additionally 
comprising administering a chemotherapeutic agent 

A method according to Claim 24 wherein the chemotherapeutic 
agent is at least one of cisplatin, dacarbazine, tamoxifen, 
nitrosourea, vinca alkaloid; melphalan, doxorubicin, adriamydn, 
etoposide and 5-fluorouraciL 

A method according to any one of Claims 18 to 25 further 
comprising delivering into cells of the tumour a construct 
comprising a B7 coding region and means for expression thereof 
in a tumour cell. 

A method according to any one of Claims 18 to 25 comprising 
delivering into cells of the tumour a construct comprising a B7 
coding region and a cytokine coding region and means for 
expression thereof in a tumour cell. 
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